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Macroeconomics: a succinct 
introduction 


For many postgraduates macroeconomics is a totally new subject. For 
some undergraduates, it is useful to have a bird’s eye view of the ba¬ 
sics of the subject before plunging into more advanced material. This 
chapter therefore sketches in some basic elements of macroeconomics as 
a foundation for what follows. It takes nothing for granted except a 
knowledge of elementary microeconomics — essentially supply and de¬ 
mand. Macroeconomics is about the behaviour of whole economies but 
since these are made up of individuals and firms acting in the market¬ 
place together with other institutions like governments that are respon¬ 
sible to individuals, it is natural to think of our understanding coming 
from microeconomics which analyses how people interact in a particular 
market and build up the whole economy from there. That is how the 
‘classical’ economists — those who wrote before John Maynard Keynes 
— naturally analysed the economy. While their thinking was for a long 
time overlaid by the work of Keynes and his followers, in recent times 
it has once again become the core of our macroeconomic analysis; we 
shall begin with the early classical thinking, then sketch in the ideas of 
Keynes and of his later followers, before building up the modern macroe¬ 
conomic theory that rests essentially on classical foundations with some 
extra elements drawn from Keynes’ work. This approach, via a short 
history of macroeconomic thought, is the easiest way, experience shows, 
to reach an understanding of modern thought. (Students may like to 
consult a good intermediate textbook such as Parkin and Bade, Modern 
Macroeconomics , any edition, in conjunction with this chapter.) 
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Models of the Economy 


THE CLASSICAL MODEL 

The natural starting point for macroeconomics is the classical model, 
where it is assumed that prices and wages are flexible as in the familiar 
microeconomic model of a market with supply and demand. The differ¬ 
ence from that model is that supply and demand are for the aggregate 
of all agents in the economy — it is assumed that one can meaningfully 
aggregate up or average the behaviour of all in this way; implicitly we 
assume that when income or activity is redistributed across people or 
firms the average is not much affected so that we can focus on the aver¬ 
age forces affecting the economy rather than on their distribution across 
agents. Aggregate supply in this theory depends on technology, prefer¬ 
ences and relative prices. Demand depends on the quantity of money 
and on incomes. 

Supply 

Figure 1.1 shows four quadrants, bottom left is the labour market, top 
left the production function, top right a 45° graph transferring output 
through to the bottom right, which is the supply curve between the price 
level and output. 

The production function, PF , which can be thought of as the optimal 
behaviour of the average firm times the number of firms, takes capital 
as given and hence shows diminishing returns of output ( Q ) to increas¬ 
ing labour input (L). From it is derived the demand for labour, DD, 
where (for the average firm) the real wage equals the marginal product 
of labour, or equivalently the price = marginal cost, the first-order con¬ 
dition of a profit maximum. The supply of labour, SS, is shown with a 
flattish slope, indicating that the substitution effect of rising real wages, 
jy . is substantially larger than the income effect; this could come about 
because either workers substitute effort across time or unemployment 
benefits are generous and long-lasting (as discussed below, respectively, 
in Chapters 3 and 9). 

Two aggregate supply, AS, curves are shown. One is vertical, indi¬ 
cating that if all prices ( P ) rise the supply of and demand for labour will 
not change since wages (offered and demanded) will rise in proportion, 
and so with relative prices the same there is no inducement to alter of¬ 
fers. The second curve is upward sloping; the reason is that as prices 
rise people do not realise that all prices are rising (inflation). Workers in 
particular do not realise this and in this diagram are assumed to be slow 
therefore in raising their wages. Hence as prices rise wages lag behind 
and real wages fall to the right along the demand for labour curve (this 
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Figure 1.1: Aggregate Supply 


can be thought of as a rightward shift of the labour supply curve). This 
generates more employment and more output (this is developed fully in 
Chapter 3, first section, ‘The New Classical Phillips Curve’). 

Demand 

We suppose for simplicity an economy where there is no alternative to 
using cash for paying workers and for buying goods. In this ‘cash-in- 
advance’ transactions technology, we also assume there is a payment 
period, one month for example. Then the economy ‘turns money over’ 
as illustrated in Figure 1.2). 

At the end of the month workers are paid and spend their cash 
through the succeeding month. As cash moves from workers to firms 
the total holding of it remains constant — this is the demand for cash 
for transactions. If we measure money output, PQ, in a year or at an 
annual rate, it follows that cash required is equal to one month’s output 
and so the demand for money, M D = (1/12) x PQ (where 1/12 is the 
fraction of output required as money, K) or equivalently M D x 12 = PQ 



6 


Models of the Economy 


(where 12 is the ‘velocity’, V, of money); respectively the Cambridge 
(UK) and the quantity theory ways of writing money demand. 



Figure 1.2: Transactions Demand for Money 

Begin by supposing that the money supply M equals M°. that is 
there is equilibrium in the money market. Under our assumptions this 
also implies that income equals spending (as in figure 1.2). Now suppose 
the government increases the money supply by 10 per cent (by printing 
more) and gives the extra money to workers at the end of the month. 
At existing PQ( 0) the workers plan normal spending of PQ{ 0) which is 
their normal income and firms need to pay out only PQ(0) in wages; so 
neither workers or firms need to hold on to the extra money for future 
months. The workers will therefore spend the extra money on buying 
goods, driving up PQ along the AS curve. At this point PQ( 1) is higher; 
firms hold more money to pay for extra work. But when M returns 
to consumers, still M s > M D and again AD > PQ(1). Hence the 
process will repeat itself and PQ will rise again. Only when PQ has 
risen sufficiently for M s = M D = -^will we have AD = PQ. This is 
where PQ = MV. This is the ‘aggregate demand’ consistent with the 
stock of money (Figure 1.3). 

In classical thinking prices and wages move rapidly to restore the 
economy to its potential, Q*\ hence interest rates move to ensure that 
saving equals investment — for example, if at Q* people decide to save 
more this would mean more funds would be offered to investors in plant 
and machinery and interest rates (the cost of these ‘loanable funds’) 
would fall until the market for savings equilibrated with investments. 
So we can say that interest rates are not affected by the demand for or 
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Figure 1.3: Aggregate Demand 


supply of money, they are merely a (small) factor in determining the 
demand for money. Figure 1.4 illustrates the market for loanable funds. 

Short- and long-run equilibrium in the classical system in response 
to a monetary expansion can be summarised by Figure 1.3. In the short 
run output rises because wages lag behind, but in the long run output is 
driven back to Q* as wages rise in response to workers’ realization that 
it is general prices that are rising. 

To allow for the effect of government spending, G, and taxation, T, 
we simply note that T — G is part of savings, viz. savings by govern¬ 
ment; while private savings are now PQ — T — C. So total savings are 
T — G + PQ — T — C = PQ — G — C. Hence for example a rise in gov¬ 
ernment spending, taxes constant, would shift the SS curve in Figure 
1.4 leftwards and raise interest rates; it would not raise output even in 
the short run. (It could do so indirectly via the effect of higher interest 
rates on the demand for money, a factor we introduce in the next sec¬ 
tion — by reducing this demand it would shift the aggregate demand 
curve outwards. However this effect is likely to be quite small, since the 
transactions demand for money is rather inelastic.) 

Allowing money to respond to interest rates: the Wicksellian 
mechanism 

We have assumed up to now that interest rates have a negligible effect 
on the demand for money, and that therefore we could treat the market 
for money and bonds as essentially unconnected. But this can only be 
a stringent simplifying assumption. In fact interest rates do affect the 
demand for money because people and firms can put their money into 
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Figure 1.4: Supply and demand for loanable funds 


deposits (short-term bonds or loans, in other words) as an alternative 
to continuously holding the money they will eventually need for trans¬ 
actions. Households can, for example, deposit half their month’s wages 
for the first half of the month and then withdraw them half-way through 
the month in time to carry out the second half of the month’s transac¬ 
tions; firms can in turn deposit their takings from the first half of the 
month and then withdraw them at the end of the month, just in time to 
pay the wages. This is illustrated in Figure 1.5. Clearly people decide 
whether to carry out this deposit strategy by weighing up the cost ver¬ 
sus the benefit: the cost is making two trips to the bank or loan market 
(one to deposit, one to withdraw) per month; the benefit is the interest 
received. The higher the interest rate the more people may decide to 
deposit; for example, households could deposit three-quarters of their 
wages and then withdraw a quarter after week 1, a quarter after week 
2, and the last quarter after week 3, while similarly firms could deposit 
all their first three weeks’ takings and withdraw them at the end of the 
month — making four trips in all for both firms and households. We now 
find that the demand for money falls with higher interest rates, with the 
demand for deposits moving in the opposite direction — imagine Figure 
1.5 with twice as frequent deposits and withdrawals, implying that the 
demand for money is half as much again as in the figure. 

Now that we have linked the two markets the question arises: what 
happens when extra money is printed and, say, distributed to households 
in our previous experiment? Previously we said that households and 
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Figure 1.5: Money demand when money is deposited/witlidrawn once per 
month 


firms would spend any money surplus to their transactions needs. But 
now we would argue that they would put the extra money on deposit 
mainly, perhaps only spending a very small part of it; the reason would 
be that they would naturally wish to spread out the extra spending over 
a long period, given that they can now deposit it and obtain interest 
while waiting to spend it. 

If they do this, then as the Swedish economist Knut Wicksell pointed 
out (Wicksell, 1970), in a way that foreshadowed Keynes’ views of asset 
markets and demand, the depositing of the extra money would act indi¬ 
rectly to create excess demand by stimulating investment and reducing 
saving (stimulating consumption). Before the money injection, savings 
were equal to investment and hence demand for output was equal to 
supply of output: S = I implies that S = PQ — C — G = I so that 
C + I + G = PQ that is demand equals output (as supplied along the 
AS curve). He thought of this situation as one where interest rates were 
at their ‘natural’ level: namely such as both to clear the loanable funds 
market and to set demand for output equal to supply of output. 

But after the injection interest rates fall below this natural level, 
stimulating consumption and investment above these levels, and so cre¬ 
ating excess demand. This would drive up prices or output in just the 
same way as we described before when we did not allow people to de¬ 
posit their money. Aggregate demand would rise until the demand for 
money rose to the new level of money supply, at which point of course 
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interest rates would return to their natural level as the injection of ex¬ 
cess money into the loanable funds market would cease (see Figure 1.6). 
This mechanism of ‘monetary transmission’ via interest rates is close to 
what Keynes proposed and what we think today. 


Excess supply of 
money 



Figure 1.6: Wicksell’s monetary transmission 

We can think of the central bank carrying out monetary policy by 
‘open market operations’ in the bond market which inject more or less 
money into the economy; for example, by buying bonds, the bank sup¬ 
plies extra loanable funds to the the bond market, shifting the SS curve 
rightwards by printing extra money. This drives down interest rates, 
creates a gap between investment and saving, that is, excess demand 
for output. In the following period the extra money has entered private 
sector holdings but being surplus to money demand is put back into 
the loanable funds market, keeping interest rates down until the excess 
demand has driven up PQ sufficiently to raise demand for money to 
equality with the extra supply. 

We can also analyse what would happen if demand for money fell 
say for the reason discussed above where people decide to take advantage 
of the interest rate opportunities from depositing (that is lending) their 
money. The surplus money would go into the loanable fund market like 
the extra money printed by the central bank just discussed; triggering 
the same process whereby PQ must rise until the available money supply 
is willingly held. 

The algebra of the Classical system is: 

Production function: Q = f(L ) 

Demand for Labour: W/P = f l '(L) 
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Supply of Labour: L = g(W/P E ) — P E (the price level expected by 
workers) in the short run is assumed to lag behind the actual price 
level. 

Aggregate Demand: M.V(R) = PQ (this assumes that monetary dise¬ 
quilibrium has already been eliminated by the Wicksell process). 

Loanable Funds: S(R, PQ) = I(R) 


THE KEYNESIAN MODEL 

The ideas of Keynes are best understood as a special case, or a degen¬ 
erative distortion, of the classical model — created by assuming that 
wages and prices are inflexible. Nowadays we try to rationalise this as¬ 
sumption by referring to nominal wage contracts and ‘menu costs’ of 
raising prices. But neither Keynes nor his post-war followers did so: 
they simply pointed to the facts of wage and price rigidity and said that 
non-economic factors (such as conventions of fairness, policed by unions 
among others) could be responsible. 

The simplest framework, which we will use here, assumes that both 
wages and prices are rigid — that is to say, are set by suppliers who 
thereby state their willingness to supply whatever is demanded at those 
rates. From time to time they may change them — in response to excess 
demand or supply, or to general inflation — but that does not alter the 
point that whatever they have set their supply is driven by the demand 
they face. It is natural to assume in the Keynesian model that wages 
are governed by bargaining between workers or unions and firms and 
that prices are set by firms operating in imperfect competition (where 
each firm has a different product and some short-run ‘monopoly power’ 
— that is a downward-sloping demand curve for its product). These 
assumptions imply price- and wage-setting behaviour (whereas perfect 
competition, for example, where wages and prices are given to the indi¬ 
vidual, would not). 

With prices set in this way plainly demand sets output — in mi¬ 
croeconomics we would say that the supply curve is horizontal. Keynes 
(and his co-workers, including Richard Kahn) evolved a theory of de¬ 
mand feedback which he called the ‘multiplier’; the idea is illustrated 
on the ‘Keynesian cross’ diagram, Figure 1.7. Consumption, C, depends 
on income (= output); other demand is treated as exogenous — notably 
investment, /, which is mainly dependent on businessmen’s optimism 
(‘animal spirits’). Without government, aggregate demand, AD = C+I, 
therefore rises with income — the AD curve which slopes at an angle less 
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than 45°. However, output is set by producers equal to demand; hence 
Q = AD — the 45° line. Realistically Keynes thought of this output 
as responding with a lag to demand (initially orderbooks or inventories 
would respond). Output would go on moving until Q = AD , yet of 
course AD would respond (feedback) to changing Q. The two would 
settle down where the two curves intersected (Figure 1.7). 


Q=AD 



Figure 1.7: The Keynesian Cross 

The algebra is: 

Ct = a, + bQt 
AD t =C t +1 
Qt. — AD t -i 

The equilibrium of this, where AD = Q = constant, is found by 
setting Q and AD constant and solving the equations: Q = (a+I)/(l—b). 
1/(1 — b) is the multiplier. 

This set-up can be extended to allow for government: its spending, G, 
is exogenous and its taxation can be represented by a simple proportional 
tax rate, r. This means that in Figure 1.7, I now becomes I + G and 
that the slope of the AD curve is flatter because now extra income 
is partly taxed and does not have such a large effect on consumption. 
Algebraically: 

C t = a + bQ t [1 - r] 

ADt = Ct + / + G 
Qt = AD t ~i 
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The multiplier is now 1/[1 — 6(1 — r)], smaller than before. 

The IS curve 

We now notice that when Q = AD we also have savings (as planned 
- that is consistent with the consumption function above) equal to 
investment (as planned exogenously above) since it implies that Q — C — 
G = I. (Q is income, C is private consumption and G is government 
consumption; it is usual to include government investment in I.) 

In passing, we should not be confused by the fact that in the short 
run savings can be unplanned (for example, when income rises it may 
take time to adjust spending) and also investment can be unplanned (for 
example, when demand falls inventories may increase as it takes time to 
adjust output; the rise in these inventories is included in actual invest¬ 
ment). It is also a confusing fact that by the conventions of national 
accounting actual savings are always equal to actual investment because 
output minus actual spending = rise in inventories, DV by definition; 
therefore Q = actual consumption + G + investment in plant and ma¬ 
chinery minus DV = actual C + G+ actual I. Hence Q— actual C — G = 
actual I. This, however, is an identity with no behavioural significance, 
unlike in the above paragraph. 

In the Keynesian model planned savings = planned investment when 
output has settled down. Thus we would say that output is what drives 
them to equality, not interest rates as in the classical system. What 
then do interest rates do? According to Keynes they affect demand and 
so output — they merely change the output level at which savings are 
driven to equality with investment. (Unlike the classical system where 
output is driven to potential by supply via prices moving, and interest 
rates then drive savings to equality with investment.) Interest rates affect 
demand straightforwardly: as they rise consumption and investment fall 
because of the higher cost of borrowed funds and return to saving. 

The result is a shift in the AD curve in the Keynesian cross diagram; 
we can plot the different Q equilibria as interest rates shift the curve. 
This is the IS curve, that is the curve showing how output changes with 
interest rates given that the goods (output) market is in equilibrium or 
equivalently that planned S= planned I (Figure 1.8). 

The LM curve 

But how are interest rates determined? In the classical system by sav¬ 
ings (supply of funds) and investment (demand for funds) clearing the 
market for loanable funds; but in Keynes’ system savings and investment 
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Q=AD 


AD(R 0 ) 

AD(R-i) 


Figure 1.8: The IS curve 


determine output (also affected by interest rates through their effect on 
savings and investment). But interest rates are not pinned down — the 
IS curve shows that instead they can vary, and as they do so output will 
also vary. So what pins them down? 

In the classical world the market for money is cleared by income in 
the short run and then rather quickly by prices. This does not happen 
in Keynes’ world. Prices are fixed or moving slowly in response to price 
setters’ strategies. Output (income) is responding to aggregate demand 
which has nothing directly to do with the demand for money; it is cer¬ 
tainly not moving to make demand for money equal to its supply. We 
must now ask what is happening in the money market. 

Keynes took over the classical analysis of demand for money above 
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but he also emphasised the alternatives to holding money (defined as 
non-interest-bearing cash): these were short-term or long-term ‘bonds’, 
that is loans. If we divide the world into private and public sectors, this 
meant that the private sector holding money (a liability of the govern¬ 
ment) could also hold public loans as an alternative. The commercial 
banks are assumed to be subject to government regulation (for example, 
by reserve ratios) and their deposits are usually treated as equivalent 
to cash; so cash and non-interest-bearing deposits are ‘money’. Besides 
holding this money for transactions, private people and firms might hold 
it because they wanted to get out of bonds for speculative reasons: if 
interest rates were low (bond prices high), people would expect bond 
prices to fall back and so hold more cash and vice versa when interest 
rates were high (bond prices low). Private sector financial wealth in a 
closed economy is by definition equal to the value of public (and banking) 
sector liabilities, money, M and government bonds, B. 

Holdings of equities are what firms issue to households as paper 
claims to the capital stock which of course households own; these eq¬ 
uities have to be held by someone and so their price moves until they 
are willingly held by households. But note that as they are a liability of 
firms they cancel out within the private sector and hence do not consti¬ 
tute a financial asset for the private sector as a whole. So there is no way 
the private sector can move out of bonds and money into equities; all 
that would happen if they tried is that the equity price would go up to 
frustrate them. That equity price may have implications for the rest of 
the economy (for example, if it is high, it implies that expected returns 
on the capital stock are high and so investment is likely to increase); 
but as far as the market for bonds and money are concerned it has no 
implications of any importance as we shall see. 

The total financial wealth of the private sector, consisting as we have 
seen of money and bonds, has therefore to be held either as money or 
bonds: 


M + B = M d + B d 

Hence when M = M , B = B D . In other words the market for 
money is the mirror image of the market for bonds, as illustrated in 
Figure 1.9). 

Hence interest rates are determined by the ‘stock markets’ in Keynes’ 
model: specifically the markets for government stock (or equivalently the 
‘market for money’). Here we see that as output is driven higher or lower 
by the IS curve (the multiplier process) the demand for money moves 
higher or lower with it for transactions reasons; and this in turn drives 
interest rates higher or lower as the financial markets equilibrate. The 
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Money 


Bonds 



(Supply of M) (Supply of Bonds) 


Figure 1.9: Demand for money and bonds 


resulting relationship between output and interest rates is the LM curve 
(Figure 1.10). 

So Keynes rearranged the relationships in the classical model, cru¬ 
cially allowing output to be driven far from potential because he cut off 
the equilibrating effect of flexible prices. 

The algebra of the Keynesian model is: 

(■ IS)Q = f(G,r) 

determining Q 

(LM)M = m(R, Q; P) 
with P given; determining R. 


THE PHILLIPS CURVE AND THE NEO-KEYNESIAN 
SYNTHESIS: 

We now note that the Keynesian system has a hole — price determi¬ 
nation. For a long time after the Second World War economists were 
not concerned because the assumption that prices moved slowly — at 
or close to zero inflation — stood up. But systematic policy stimulus 
keeping economies at low levels of unemployment produced inflationary 
behaviour that the Keynesian model could not account for. The first re¬ 
action of economists was to extend it to include a ‘disequilibrium theory’ 
of price change. The idea was simple: output was determined in the short 
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Figure 1.10: The LM curve 


run by demand as in the Keynesian model but prices would be changed 
by price setters in response to the gap or disequilibrium between this and 
their supply as measured by output potential, the equilibrium produced 
by technology and preferences. Such a relationship was first estimated 
by A.W. Phillips (1958) for UK data on inflation and unemployment 
going back a century: hence the Phillips Curve. Since unemployment 
and output are closely related through the production function assuming 
some measure of the labour force given exogenously, the Phillips Curve 
can be drawn either with unemployment along the horizontal axis or 
with output along it but with a reversed slope. 

This relationship worked well for the 1950s and 1960s but clearly 
broke down in the 1970s as rising inflation was associated with rising un¬ 
employment. The reason was suggested by Phelps (1970) and Friedman 
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(1968) independently: price setters were aiming to adjust relative prices 
in response to excess demand and so their actual price change would be 
equal to the response to excess demand, plus expected inflation — only 
then would relative prices rise by the response to excess demand. This 
expectations-augmented Phillips Curve is shown in Figure 1.11. 

A further development was a theory of expectations formation rooted 
in rational calculation — ‘adaptive expectations’. In parallel with the 
disregard of inflation had gone a casual attitude to expectations of in¬ 
flation: it was typical to assume that expected inflation was equal to 
current inflation (‘static’ expectations); for some purposes this might be 
altered to ‘extrapolative’ expectations where the recent change would be 
extrapolated into the future. Both were arbitrary. Now economists such 
as Marc Nerlove (1958) and Phillip Cagan (1956) replaced them with 
the idea that expectations would be revised in response to past errors: 

Pt ~ Pt-i = k \Pt ~ Pt-i 1 

where p t = expected and actual inflation. Under certain circum¬ 
stances it is rational to adjust expectations in this manner — in partic¬ 
ular, when people are ignorant about the model and the policy process 
it is a good approximation to a rational learning procedure. 

Finally, in a world of inflationary expectations we must change the 
IS curve: demand now responds not to (nominal) interest rates but to 
real interest rates — that is interest rates minus the expected rate of 
inflation for the period of the loan. The reason is plain: consumers and 
firms borrowing to undertake purchases pay back their loans in money 
that is worth less by the rise in prices, hence the true expected annual 
cost to them of the loan is the rate of interest less the inflation rate they 
expect. (The same would apply if they were paying for their purchase 
by running down their savings — then it would be the real interest they 
expect to sacrifice on the savings deposit.) 

We now have a model of the IS and LM curves to determine de¬ 
mand, output and interest rates in the short run; and a Phillips curve 
augmented by adaptive expectations (Figure 1.11) to determine output, 
inflation and interest rates in the long run, essentially as in the classical 
model. This neo-Keynesian synthesis between Keynesian and classical 
thinking dominated macreconomic thinking in the 1970s and 1980s and 
is still highly influential today. (For further reading about this model 
we recommend Laidler and Parkin (1975) and for mathematical back-up 
on solution methods a textbook on mathematics for economists, such as 
Chiang, 1984). 

The model can be represented by five equations set out in Box 1.1; 
they are respectively the IS, LM and Phillips Curves, the adaptive 
expectations process, and the definition of the real interest rate. The 
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equations are deliberately simplified to the maximum — especially, out¬ 
put and prices are all expressed as logs (hence the assumption is made 
that the relationships are linear, that is additive, in logs) and there are 
no lags except in expectations formation. Notice that in the IS curve 
real interest rates employ inflation expected for the coming period, p) : + i, 
rather than inflation expected last period for this, p%, which enters the 
Phillips Curve. 

This model can of course be solved on a computer with numbers 
assigned to the parameters. But we would like to know its qualitative 
properties as well. There are two main ones: the nature of its equilibrium 
and its ‘dynamic’ properties (that is how it moves from one equilibrium 
to another when a shock occurs). 



Expected Inflation rises in response to gap between 
p and p e =0 (PP shifts up) 


Figure 1.11: The expectations-augmented Phillips curve with adaptive expec¬ 
tations 

Box 1.1 shows how the relevant inflation equilibrium — that is resting 
point — can be found for this model. Box 1.2 shows how the dynamic 
properties can be found. 
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Some notes on the equilibrium 

Equilibrium is where the model economy comes to rest, that is, its en¬ 
dogenous variables attain a constant or steadily moving value. What 
this will be in detail depends on the motion of the exogenous variables 
(it is possible for a model not to have an equilibrium at all if these move 
in particular ways). In our case we will assume that y and d are both 
constant and that fh t is growing steadily; this will allow us to look and 
(as we shall see) find an equilibrium where inflation, expected inflation, 
interest rates (real and nominal) and output are all constant — and 
prices are growing steadily (in line with the constant inflation rate). 

Box 1.1 shows the detailed steps. The meaning of these steps can 
be quickly explained. First, the Phillips Curve with adaptive expecta¬ 
tions implies that the economy must be at y, or on the ‘long-run Phillips 
Curve’ — by which is meant the relationship between inflation and out¬ 
put when expected inflation is equal to (has caught up with) actual 
inflation. This relationship can be understood by a policy experiment (a 
foolish one). Suppose the government decided to stimulate the economy 
and keep it above y permanently: inflation would rise initially along the 
short-run Phillips Curve, but then expected inflation would rise too, and 
inflation would rise further. It can easily be checked that inflation will 
rise indefinitely, hence the vertical curve in the long run — thus illustrat¬ 
ing the point that for inflation to settle down, output cannot permanently 
exceed (or be less than) y. This point is sometimes referred to as the 
accelerationist proposition. Figure 1.12. 

Note that we still do not know where inflation will settle, only that 
output is at y. So secondly we can enquire what inflation will be: this is 
given by the LM curve. If we ask what the implied rate of change of the 
variables in the LM curve must be in equilibrium, we can see that since 
neither interest rates nor output are changing then the rate of change of 
the money supply must be equal to that of prices, that is inflation equals 
the growth rate of money, the fundamental proposition of monetarists 
such as Milton Friedman, Karl Brunner and Allan Meltzer. This gives 
us the point on the long-run Phillips Curve where inflation will settle. 

Given output at y we can establish what real interest rates will be 
from the IS curve: real interest rates must move to equate demand with 
the available supply — the ‘crowding-out’ principle (making space for 
any government demand pressure, d, by reducing private demand). We 
can show this in the IS-LM diagram as follows: clearly in equilibrium 
the IS curve must be located so that output is y at the prevailing interest 
rates, the curve is shifted upwards by expected inflation and rightwards 
by d, hence the point on this IS curve above y will be the implied nominal 
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Figure 1.12: The accelerationist proposition 

interest rate made up of inflation plus the necessary real interest rate — 
Figure 1.13. 

This leaves us last with the price level. Diagrammatically we can 
see that the LM curve must intersect the IS curve at the point that 
lies above y. We must shift the LM curve to this point: given that the 
money supply is on its steady growth path, the only way to do this is for 
prices to change. The LM curve is therefore solved for the price level. 


Box 1.1 


THE NEO-CLASSICAL/KEYNESIAN 
SYNTHESIS MODEL AND ITS EQUILIBRIUM 

lh = d- ar t (IS) (a) 

rn t = p t + 7 y t - (3R t (LM) (b) 

p t = Pt+ <%t — V) (Phillips curve, PP) (c) 

Pt = Xpt-i + (1 — X)pt_i (adaptive expectations, AE) (d) 
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Figure 1.13: Inflation and interest rate equilibrium 


Rt = r t + Pt+i rea l interest rate (RR) (e) 

Remembering that A In x* = —— 

y,p and m are respectively In of real GDP, prices and money supply. 
x t = A x t =x t - x t -i 

Pt = inflation expected last period for this period 
Rt = interest rate (fraction p.a.) 

r t = real (inflation-adjusted) interest rate (fraction p.a.) 

The model’s equilibrium 

Suppose d, y are constant; ifi t is rising at a constant growth rate, 
A frit = hit = fn . Then the model has an equilibrium where 
yt, r t , Rt, Pt, Pt are a ll constant (and p t is growing at a constant 
rate). Find it by setting all these variables to their constant values 
(for p t at its equilibrium value along its constant-growth path) in 
the above equations, and then solving out for these values: 
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AE gives 0 = A p e * = A (p* - p e *) =>• p* = p e * 

PP then gives p* = p* + S(y* — y) ==> y* =y 
IS then gives r* = 

RR gives R* = r* +°p* 

LM, differenced, gives A fh = p* + 7 A y — /3AR* so that as Ay = 
A R* = 0 it follows that p* = Am 
Finally solve for p* from LM as 


Pt = m t - 72 / + (3[r* + p*] =m t -^y J r f3 [-—- + Am] 


a 


This final step tells us an important feature of this model: that when 
the growth rate of the money supply rises the price level will rise by more 
than the money supply level — and vice versa when the money growth 
rate falls, prices fall by more than money supply. The reason for this 
is that the demand for money responds to interest rates: when money 
supply growth increases, interest rates rise and reduce money demand, 
so causing the excess money to be spent until prices have risen enough 
to restore the money demand to equality with money supply. This is 
sometimes referred to as ‘price overshooting’. Consider, for example, a 
hyperinflation caused by a very large rise in money supply growth: this 
causes a ‘flight from money’ as above, which greatly adds to the infla¬ 
tionary pressure in the economy, driving prices to levels much higher 
than the rise in the money supply itself. The contrary holds when the 
hyperinflation is cured, say by a currency reform: then deflationary pres¬ 
sure sets in as people rush back into holding money, so cutting demand 
for goods — this implies that a currency reform should (oddly, it might 
seem) make liberal amounts of the new currency available in order to 
stave off a sharp deflation. Figure 1.14. 

Notes on the dynamic properties 

To understand the nature of the dynamic path we can conduct a policy 
experiment — a simple one-off money supply shock, for example - 
and examine how it affects the economy period by period, simply using 
the model’s equations in a diagrammatic way. So in period 1 the LM 
curve shifts rightwards, triggering higher inflation; this raises expected 
inflation for the next period which shifts the IS curve upwards, giving us 
the period 1 effect marked as 1 in Figure 1.15. In period 2, the Phillips 
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Figure 1.14: Price overshooting 


curve shifts up with higher expected inflation; by now the LM curve is 
shifting back to the left as inflation exceeds the growth of money supply. 
The IS curve also continues shifting upwards with still higher expected 
inflation — giving us point 2 in Figure 1.15. It is easy to continue into 
further periods and show that the effects circle as shown. (This particular 
model in fact requires most unusual parameter values to behave in any 
other, non-circular, fashion, though it can do so.) 

Figure 1.15 also illustrates that all the variables share the same mo¬ 
tion. This follows from the simultaneity of the equations in the model 
— because all the variables react on one another they must obviously all 
move consistently, rather like the waves in a sea must all have the same 
sort of motion since they are all connected to one another. To discover 
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Figure 1.15: One-off rise in nit — dynamics 


the model’s ‘equation of motion’ or ‘characteristic equation’, we follow 
a series of steps shown in the next box. What we are doing there is to 
reduce the model — by simple substitution — to a single equation in one 
variable (as deviations from its equilibrium value) and its lagged values: 
a difference equation (in this case ‘homogeneous’ because it equals zero, 
all the exogenous variables having been eliminated by expressing the en¬ 
dogenous variables in deviations from equilibrium). We can then readily 
show (see Box 1.2) that all the other variables have the same difference 
equation as the variable we have chosen. 

Having obtained this equation we can then examine its behaviour, 
using the standard analysis of difference equations — see Chiang (1984) 
for example. 

Comments on the model and its policy implications 

The type of model we have been examining was the work-horse of macro¬ 
economic modelling in the 1970s (clearly the versions used were often 
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much more complicated, especially with the introduction of lags in the 
relationships). It has ‘monetarist’ properties as we have seen, in that 
inflation depends on money supply growth, and output equals its ‘po¬ 
tential’ or ‘natural’ level, in the long run. But it was also used by 
Keynesians, such as James Tobin, who stressed its short-run properties 
— such as taking a long time to get to long-run equilibrium and being 
prone to generate large business cycles. In a number of interchanges dur¬ 
ing the 1970s Keynesians and monetarists agreed to disagree about the 
likely size of different parameters in such a model and on appropriate 
policies but acknowledged that on the nature of the model itself they 
were essentially in agreement. 

By the end of the 1970s the model was under serious challenge. At 
least as used by most forecasters it had failed to predict the serious in¬ 
flation of the 1970s (associated with the tripling of the oil price in 1973) 
or the recession that went with it. The heart of the objection was the 
mechanism of adaptive expectations which gives the model its circular 
behaviour (as can readily be checked by reworking the model on the 
assumption that, for example, inflation expectations are determined by 
the growth of the money supply); the new ‘rational expectations’ school 
(discussed next) argued that people could do better. By ‘adapting’ they 
regularly made easily-preventable forecast errors: for example, if there 
were a one-off rise in the money supply of the sort we examined above, 
they would first underpredict, then overpredict, inflation and then re¬ 
peat this cycle of errors indefinitely as the economy circled around its 
equilibrium inflation which of course remains precisely unchanged. Such 
expectations behaviour was unintelligent, irrational — hence unlikely — 
and a poor basis for predicting the behaviour of the economy, sensitive 
as this is to expectations. 


Box 1.2 

DYNAMIC PROPERTIES OF THE NEO¬ 
CLASSICAL/KEYNESIAN SYNTHESIS MODEL 
REDUCING THE IS-LM-PHILLIPS CURVE 
MODEL TO CHARACTERISTIC EQUATION 

1. Subtract equilibrium values from each equation, to write 
model in deviations from equilibrium e.g. IS curve: 

6y t = -ar t +d 
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-y* = +ar* - d 

implying y t - y* = -a(r t - r*) 


2. Model in deviation form becomes (all variables in deviations 
from equilibrium): 

Vt = -ar t (IS) (a) 

o = Pt+lVt- f3Rt (LM) (b) 

Vt = Pt+ fiyt (Phillips Curve) (c) 

Pt = Xpt-i + (1 — A )Pt_i (Adaptive expectations) (d) 

Rt = r t + Pt+i ( e ) 

3. Derive Aggregate Demand function from (a), (b), (e). From 
(b) and (e). 

1 -y 

n = +~}Pt + -pVt - Pt+i 

Substitute into (a) and collect terms, implying: 
a a/3 

y = -j t^,’ h + ,AD curve) 

Write (c) as AS curve y t = \{pt~ Pt) an< I equate to AD 


V th - ® 


(3 + aj 


Now (Koyck transformation) subtract (1 — A) x lagged value from 
both sides. (Equivalently, use the lag operator, L, where Lx t = x t ~\. 
So A is a symbol instructing us to lag a variable; it can be treated 
algebraically in the same way as a coefficient. 

Rewrite pf = XLpt + (1 — A )Lpf or pi = j-^jr-\jzPt- Now substitute 
this expression in the above equation and multiply through by 1 — 
(1 — X)L.) 

Through (d) this yields 

\{Pt ~Pt- 1) = —(Pt - [1 - A]pt_i) + °f (Apt) 

o p + ay P + cry 

Since =p t — Pt-i, substitute all through for p t , pt-i, Pt -2 hi place 
of p t , '{h-\ and collect terms to get: 

Pt + apt + bp t -2 = 0 
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where 


a 

b 

s 

r 


— (1 — A + 2s — r) _ 

1 + s + r 
s 

1 + s — r ’ 
f3 + cry. 
a8 ’ 

px 


Notes on difference equations 

Solutions to second order equation x t + ax t ~i + 6 a ; t _2 = 0 is 

x t = A\\ + BX\ 


where Ai and A 2 are solutions of quadratic 


A T aA T b — 0, Ai, A 2 


—a ± Vci 2 — 46 
2 


In case of complex roots (where 4b > a 2 ), the solution can conve¬ 
niently be written: 


xt = AA* cos (6t, — e); A = Vb\ cos 9 


a 

2A 


The length of the cycle is ^21. 

The sufficient and necessary conditions for stability are: 


IT a b T 0 

1 — a T b > 0 

1-6 > 0 


Against this critique, the model’s users claimed that adaptive ex¬ 
pectations were a reasonable approximation when, as was usually the 
case, people had poor knowledge either of the workings of the economy 
or the processes driving policy or both — a neat demonstration of this 
point was made by Benjamin Friedman (1979), no relative of Milton 
Friedman. Then adaptive expectations is close to an optimal learning 
procedure. True, if there were to be just one shock such as a money 
supply rise, then the model would break down as people would soon re¬ 
alise what had happened; but in practice a lot of shocks are continuously 
hitting the economy and so people cannot easily work out what is going 
on: the model would be a good approximation then. 
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If we accept for purposes of argument that the model does make sense 
in some such terms, what policy implications can we draw from it? We 
can identify three main issues: 

1. Rules versus discretion in monetary policy. 

Monetarists argued that the money supply should be kept to a steady 
growth rate as far as possible (rules) rather than being varied in response 
to the perceived state of the economy (discretion). The argument was 
based on the difficulty of knowing the parameters, or even the economy’s 
actual state, accurately — the slogan used by Friedman was that ‘the 
lags were long and variable’. 

If one reacted excessively to the economy out of such ignorance, then 
the cycles one was trying to dampen would get worse, possibly much 
worse. One can examine this point by adding into the model a money 
supply response function, in place of the exogenous money supply we 
have assumed; this will change the dynamic properties of the model and 
depending on how well it is chosen, the model can become more or less 
cyclical. Monetarists argued that the prudent (risk-averse) government 
would prefer to avoid the chance of making things much worse to enjoying 
the possibility of making them much better — and so would opt for a 
rule. 

Against this Keynesians argued that such fears were exaggerated — 
we did know enough from our improving econometrics to steer the econ¬ 
omy along a less boom-bust course. 

Clearly this argument was empirical; it could only be settled by seeing 
how well or badly discretionary policy did. The experience of the 1970s, 
when there was a strong boom in the early 1070s, followed by the oil crisis 
bust, then again followed by a sharp expansion leading to bad inflation 
— did not inspire confidence in the highly discretionary policies pursued 
in that era. 

2. Gradualism 

Friedman argued that when far away from the appropriate rule, policy 
should be shifted back in small repeated steps because a large shift would 
be highly destabilising for the economy. For example, faced with high 
inflation due to past excessive money supply growth, a government that 
suddenly cut money supply growth back to a much lower rate would pre¬ 
cipitate first a huge recession, followed eventually by a large boom. This 
sequence would lead to the rejection of the policy. If, however, money 
supply growth is slowly cut, each step would induce a small recession but 
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after a few steps this would come on top of revival from the first steps; 
while the economy would be making steady long-run progress back to 
low inflation, the side-effects in lost output and unemployment would be 
more tolerable politically. 

This argument is obviously a strong one if this model is correct. How¬ 
ever. it is not clear that even gradual steps would be easy to undertake 
politically because the length of time an economy would be suffering 
from (mild) recession would be very long. Since bad inflations have fre¬ 
quently been ended, this raises questions about the appropriateness of 
the model in examining the curing of inflation: the rational expectations 
school have argued that it is too pessimistic because governments can 
express the national willingness to cure inflation and this message can be 
understood by people in the economy — hence their inflation expecta¬ 
tions will adjust rapidly downwards, and much less recession is required 
to reduce inflation therefore. 

3. Indexation 

Friedman argued that contracts (especially for wages — but also for 
government bonds) should be indexed to prices, for two reasons. First, 
this would reduce the economic damage in markets; there is a loss of 
consumer surplus due to taxes whose true rate is often raised by inflation 
— indexing the objects of taxation, levying the tax on the indexed rather 
than the nominal amount, would effectively lower the tax. Second, it 
would stabilise output around its natural or equilibrium level by speeding 
up the response of inflation to deviations from this level. In other words 
the Phillips Curve would shift more rapidly upwards in response to excess 
demand and drop more rapidly in response to excess supply. Friedman 
was most concerned in practice about curing inflation and so about the 
response under excess supply (brought about by cutting money supply 
growth). He saw indexation as a way to speed up the cutting of inflation 
without slump. 

We can illustrate the point under the extreme case where indexation 
of wages is immediate and prices are a simple margin mark-up on wages 
adjusted for productivity: in this case the Phillips curve will become 

Pt=pt + q(yt - y) 

In other words expected inflation disappears from the equation (be¬ 
cause wage bargainers do not need to forecast inflation any more), en¬ 
tirely replaced by actual inflation under indexation. It is easy to see that 
this gives us a vertical Phillips Curve — any excess supply raises infla¬ 
tion which then raises wages and so inflation ad infinitum. In this case, 
there is no need for gradualism; to cure high inflation, the government 
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merely needs to cut money supply growth at once to its target inflation 
rate, and virtually overnight inflation will drop to it with a minimal cost 
in recession. 

This seems an attractive argument; and yet of course indexation 
cannot in practice be so immediate. If so, it generates instead a Phillips 
Curve with lagged inflation on the right-hand side: 

Pt =Pt-i + q(yt -y) 

for example, where the lag is one period. But as this example readily 
shows this may worsen the instability of the economy. In this particular 
case, the one-period lag is equivalent to adaptive expectations with an 
adaptive parameter of unity; this, as we saw in Box 1.2 above, may 
produce unstable behaviour. The cycle is faster but may be explosive. 

Experience with indexation in the 1970s did indeed reveal that it in¬ 
creased the violence of the economy’s responses to shocks — increasing 
inflation in the upswing and lowering it in the downswing. For example, 
the Heath government of the UK in 1970-74 indexed wages almost uni¬ 
versally and presided as a result over an inflation explosion in response to 
a programme of policy stimulation. It was not clear from this experience 
that it was an advantage in counter-inflationary policy to have index¬ 
ation in place. During the late 1980s when counter-inflationary policy 
became general in the Organization for Economic Cooperation and De¬ 
velopment (OECD) several countries — including Italy and Belgium — 
abandoned indexation at the same time. 

We now turn from this model to discuss rational expectations, the 
hypothesis which spearheaded the challenge to this way of understanding 
the economy. 

Rational expectations: Introductory Ideas 

Expectations are fundamental in economics. Every economic decision is 
about the future, given the existing situation. Even decisions apparently 
only involving the current situation such as choosing to buy apples rather 
than pears, implicitly involve a view of the future; in this case it is a view 
both about future income which underlies the budget one is prepared to 
allocate to apples and pears, and about future prices of apples and pears, 
which dictates whether it is worthwhile storing them in place of future 
purchases. 

By definition the future is unknowable. Economics, the study of 
economic decision making, therefore is concerned with how people deal 
with the unknowable. 

The Liverpool University economist, George Shackle, made this a 
constant theme of his work. In his view (fo example, Shackle, 1958) 
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each individual constructs in his imagination different scenarios or pos¬ 
sible outcomes following on different actions he is contemplating. He 
knows now the pleasure given by the consequences he imagines for each 
course of action. The course which gives him the most pleasure when its 
consequences are imagined spurs him to action. So Shackle envisages a 
hedonistic calculus of the imagination as the basis for economic choice. 

For Shackle the pleasure involved in a course of action will depend 
not only on the ‘expected pleasure’ if the consequences anticipated were 
to occur, but also on the ‘chances’ of them occurring. A fine outcome 
will normally give less pleasure when imagined if it is fraught with un¬ 
certainty. But this will not always be true. For example, for some people 
the same outcome may seem the sweeter if it is accompanied by the chal¬ 
lenge of possible failure. Too easy a victory may not inspire a sense of 
triumph at all and may not attract the decision taker. 

Shackle therefore regards economic decisions as entirely subjective, 
because they are the product of the individual’s imagination interacting 
with the known facts of the past and present. 

Nor does he feel that we can predict how people will act except in 
matters where nature reproduces itself with regularity. For example, 
in inventory control where the product has a reliable distribution, the 
manager can and will stop production when a sample has a proportion of 
defective products higher than some percentage. In such cases Shackle 
says there is ‘uncertainty’; the rules of probability can be applied by 
people and we can infer what they will do from a knowledge of their 
objectives and of the probability distributions. 

For many decisions the major elements in the future outcome are 
not subject to reliable distributions. For example, what the voters of 
Warrington would do in July 1981, faced for the first time in Britain with 
a Social Democratic candidate, could not be described by a probability 
distribution. It was not a regular event. No sense could be given to 
‘the’ probability distribution. Such elements Shackle terms sources of 
‘potential surprise’. In evaluating future outcomes with such elements, 
each person will make his individual assessment and we cannot say what 
he will do without a complete knowledge of his psychology, even if we 
have access to exactly the same facts as he has. 

Shackle’s perceptions appear to be quite devastating to the use of 
statistical models in economics (that is econometrics). For these models 
assert that economic behaviour is predictable and regular. Their im¬ 
plication is that economic agents take decisions about future outcomes 
which are predictable and subject to well-defined probability distribu¬ 
tions. The models are supposed to be derived from assumptions about 
agents’ preferences, technology and the probability distributions they 
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face. 

The keystone of econometrics is the implicit assertion that, in the 
mass, individual decisions exhibit regularity, even though each individ¬ 
ual decision will be quite unpredictable. Suppose we could define the 
probability distribution of future outcomes. Then econometrics asserts 
that there is a ‘typical’ individual who, faced with this distribution, will 
decide in a certain way; we might in particular say that he ‘maximizes 
an expected utility function’, where the utility function is a mathemati¬ 
cal representation of his preferences. Individuals, when aggregated in a 
large sample, will behave like many typical individuals. 

This assertion is supported up to a point by the central limit the¬ 
orem, according to which the distribution of the mean of a sample of 

n random variables each of variance a 2 will have a variance equal to 
2 

— and as the sample n gets larger will tend to normality regardless of 
the distributions of each variable in the sample (see, for example, Hoel, 
1962, p. 145). But although the assertion is supported in this way, it is 
still required that an individual decision can be regarded as a random 
variable with a defined mean, generated by a systematic decision pro¬ 
cess. Shackle denies this, while econometrics asserts it. Furthermore, 
it was supposed in the above argument that the future outcomes had 
a probability distribution. But for this to be the case it is necessary 
that individuals do behave in the aggregate in the regular manner just 
described. For the distribution of future outcomes is the product of the 
interaction of individuals’ behaviour — for example, the distribution 
of future prices and quantities for sugar results from the interaction of 
individuals’ supplies and demands for sugar. 

Thus the linchpin of the whole edifice of econometrics is the postu¬ 
late of regularity in economic nature. That postulate in turn justifies the 
assumption that econometricians face regularities. Whether this postu¬ 
late is ‘true or false’ can only be settled empirically, by evaluating the 
success and failure of econometrics in attempting to apply this basic 
assertion. But many economists regard it (as did Keynes — see, for 
example, Keynes, 1939) as an unfortunate development which has per¬ 
verted policy on many important issues because of a false perception 
that numerical estimates of likely outcomes from different policies can 
be generated. 

Some of the early hopes of econometricians were that econometrics 
would enable complex problems of decision making by governments, 
firms and individuals to be reduced to the mere application of known 
techniques to models of the relevant economic environment. But these 
hopes have been cruelly dashed. Nowhere has this been more so than 
in macroeconomics. Macroeconomic models of the Keynesian (and later 
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neo-Keynesian synthesis) type were built in great profusion in the 1960s 
for most major countries and used, especially in Britain and the United 
States, as the basis for forecasting and policy. Crude forms of ‘opti¬ 
mal control’ were used by governments: typically governments made 
forecasts, using their models, of inflation and unemployment over some 
horizon and then varied policies to obtain better outcomes according 
to these models. By the mid-1970s disillusion with these methods was 
widespread, as the Western world grappled with a combination of high 
inflation and high unemployment which, in general, neither had been 
predicted by econometric models, nor had responded to policy in the 
manner predicted by these models. 

The reaction to this disillusion has been varied, with the result that 
the consensus in macroeconomics that had appeared in the late 1960s 
had abruptly disappeared by the late 1970s, as we have seen in the 
previous section. To some this was a vindication of their scepticism 
about econometrics. Others, however, have searched for reformulations 
of the models which failed in the 1970s. Rational expectations has been 
one major result. 

Rational expectations takes one step further the basic assertion of 
econometrics that individuals in the aggregate act in a regular manner 
as if each was a typical individual following a systematic decision process. 
The step is to assert that in this decision process the individual utilizes 
efficiently the information available to him in forming expectations about 
future outcomes. 

By ‘efficient utilization’ is meant that the typical individual’s percep¬ 
tion of the probability distribution of future outcomes (his ‘subjective 
distribution’), conditional on the available information, coincides with 
the actual probability distribution conditional on that information. 

It cannot be stressed too heavily — since this is a source of repeated 
misunderstanding — that this is an assertion about the ‘typical’ individ¬ 
ual; in other words, it is to be used operationally with reference to the 
aggregate behaviour of individuals. It cannot be falsified by examples of 
behaviour by any actual individual. Clearly particular individuals may 
behave in ways that exhibit no systematic rational behaviour or expec¬ 
tations formation according to normal criteria, and yet if the behaviour 
of other individuals, who contributed a dominant proportion of the vari¬ 
ability in aggregate behaviour, were rational in this sense, this would be 
sufficient to generate aggregate behaviour that exhibited rationality. 

A further point is that, like all assertions in econometrics, it is to 
be judged by statistical criteria which will be relative. Whether a rela¬ 
tionship asserted to operate for the behaviour of a group ‘exists’ or not 
will depend upon how reliable it is statistically, relative to the uses to 
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which it may be put. Such uses in turn may themselves evolve from the 
discovery of new relationships. 

There is therefore no such thing as an objective criterion for judg¬ 
ing whether an econometric assertion is valid. Rather there is a joint 
and interactive evolution of models and their uses. The 1970s crisis 
in macroeconomic policy and modelling described earlier arose because 
the uses to which models were being put required properties that those 
models turned out not to have. Therefore the question that the rational 
expectations research programme has to answer is: can models based on 
rational expectations be developed that are useful in application? If so, 
what are these uses, and how do they compare to the uses available for 
other modelling approaches? 

The assertion (or ‘hypothesis’) of rational expectations in one sense 
vindicates Shackle’s basic perception that expectations are at the centre 
of economic behaviour. For on this view behaviour reacts to expected 
future behaviour which in turn depends on current behaviour; the capac¬ 
ity therefore exists for changes in the environment to affect current be¬ 
haviour sharply, as individuals react to their perceptions of the changed 
environment and its implications for the future. Expectations are there¬ 
fore completely integrated into behaviour. 

In previous theories of expectations, this integration was incomplete. 
Econometric models used ‘proxies’ for expectations of particular types 
of outcomes. These proxies were based on what had been found in the 
past to correlate well with behaviour in a way that could reasonably be 
attributed to the operation of expectations rather than actual events. 
Hence for any particular problem in hand a proxy would be sought ad 
hoc — the most satisfactory formulation on which we have dwelt was 
adaptive expectations. However it is obvious enough that if changes in 
the environment disturbed the previous relationship between the proxy’s 
behaviour and expectations, this would affect behaviour in a way that 
would be inaccurately captured by the effect of changes in the proxy. 
Hence only under the restricted set of circumstances where the previous 
relationship is unaffected will the proxies be useful. Unfortunately, the 
restricted set excludes most of the policy experiments that are of interest 
to governments. 

In another sense the rational expectations hypothesis conflicts with 
Shackle’s vision in that it is an attempt to use econometrics to capture 
the integration of expectations into behaviour. As such, it could be 
rated as even more foolhardy than basic econometrics. For whereas in 
well-settled times, patterns of behaviour might conceivably evolve which 
could give rise to some econometric relationships, in times of change 
when expectations are being disturbed in an unfamiliar way, then surely 
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to model the effect of the changes on expectations and on their interac¬ 
tions with behaviour must be a mad attempt. For there is the capacity 
for immense diversity of imagined outcomes from the changes taking 
place; the ‘typically’ imagined outcome will be a useless construct in 
this diversity, where different individuals will be behaving in unrelated 
and possibly conflicting ways. 

This reaction correctly identifies the way in which rational expec¬ 
tations is a programme for pushing econometrics to the limits of its 
possibilities in the prediction of behaviour. At the same time the re¬ 
action correctly notes the ambition of this attempt. It is possible that 
the attempt is hopelessly over ambitious. If it is proved to be so, then 
at least the limits of econometrics will have been clearly defined. For 
it will be the case that econometric relationships can at best only be 
useful in the restricted circumstances where the environment shows con¬ 
siderable stability. This would imply that they could be used to forecast 
existing trends in an unchanged environment, but not to predict the ef¬ 
fects of changes in the environment, especially policy changes. If this 
were so, then it would equally invalidate attempts to use econometric 
models to design ‘optimal’ policy rules. Hence the implications flowing 
from a proper exploration of the potentialities of rational expectations 
are substantial and important. 

We wrote in our first edition: ‘While our own view is that these 
potentialities are considerable, it will only be a decade or so before we 
will know with reasonable clarity just what can be delivered from this 
approach.’ The view of the economics profession today, judging from the 
widespread application of rational expectations, is that much has been 
delivered from it that is useful; the hypothesis has evolved its own set of 
uses as well as revealing the limits of some traditional ones. 

The early intellectual History of the Hypothesis — an Overview 

As so often in economics, it turns out that early economists propounded 
ideas at different stages that bear a striking resemblance to the hypoth¬ 
esis. For example, Marshall (1887), in his evidence to a Royal Com¬ 
mission, argued that an increase in the supply of money would affect 
economic activity by lowering interest rates, increasing loans, expendi¬ 
ture and finally prices. However, he also added that if the increase in 
the supply of money was well known, then individuals would anticipate 
the consequent expansion in demand and the effect on prices would be 
much faster 1 . 


1 This reference was pointed out to us by Richard Harrington. 
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Modigliani and Grunberg (1954) are credited by Robert Shiller (1978) 
with the earliest post-war promulgation of the ideas behind the hypoth¬ 
esis. However, it was one of Modigliani’s collaborators, John Muth, who 
truly created the hypothesis (Muth, 1961) in the sense that he set it 
down in precise form and showed how it could be applied to specific 
problems. 

Muth’s article was written partly in order to defend the prevail¬ 
ing flagship of the 1960s in expectations modelling, adaptive expecta¬ 
tions, according to which expectations of a variable are an exponentially 
weighted average of past values of this variable. It turns out that un¬ 
der certain circumstances this is the same as the rational expectation 
(see the appendix to chapter 2). These were the circumstances to which 
Muth seemed to draw attention. His other work published at the time 
was exclusively devoted to the use of exponentially weighted averages 
(Muth, 1960; Holt et ah, 1960). 

The use of a particular modelling technique depends as much on 
its perceived tractability by economists as on its inherent plausibility. 
It was in fact the best part of a decade before economists started to 
use Muth’s concept in its own right in applied and theoretical work. 
Adaptive expectations seemed in the 1960s the best tool, partly perhaps 
because it was still new and relatively unexplored (it made its first jour¬ 
nal appearance in Nerlove, 1958), partly because of its most convenient 
econometric transformation (due to Koyck, 1954) into a single lagged 
value of the dependent variable. By contrast, the techniques for using 
rational expectations (RE) were not widely available; the solution of 
models with RE presented difficulties, overcome by Muth in very simple 
models, but not readily dealt with in the larger models being developed 
in that decade. As we shall see, solution of larger models (and also esti¬ 
mation) required substantial computing power; and it may well be that 
the rapid quickening of interest in RE modelling from the mid-1970s has 
been due to the explosion in the capacity of the electronic computer. 

The earliest published work in macroeconomics using Muth’s con¬ 
cept seriously, if only to a limited extent, is that of Walters (1971) and 
Lucas (1972a). Walters showed that the effect of money on prices would 
be substantially quickened by RE. Lucas argued that under RE (un¬ 
like adaptive expectations) monetary expansion could not raise output 
permanently above the natural rate of average (the ‘natural rate hy¬ 
pothesis’), although the responses of money to lagged output and prices 
(feedback responses) could affect the time-path of output in response 
to shocks. Note that neither author in these articles argued for the in¬ 
effectiveness of monetary policy feedback responses for influencing the 
time-path of output. 
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Another early paper was by Black (1973), who applied rational ex¬ 
pectations to the foreign exchange market, an area which has seen many 
further and productive applications of the concept. Some of this subse¬ 
quent work used Muth’s original model in a partial equilibrium treatment 
of the market, treating macroeconomic variables exogenously (for exam¬ 
ple, Minford, 1978; Bell and Beenstock, 1980). Other work (for example, 
Frankel, 1979) has estimated reduced-form models of the exchange rate 
derived from monetary models with rational expectations. Shiller (1973) 
applied it to the term structure of interest rates; here too a separate but 
closely related body of work has been extremely fruitful. 

This has taken the form of tests of the ‘efficient market hypothesis’. 
This is a combination of the rational expectations hypothesis and some 
hypothesis about market behaviour, which thus makes it possible to test 
rational expectations through the behaviour of market prices. Eugene 
Fama (for example, 1970, 1976) and his collaborators at Chicago were 
prolific pioneers in this area, and covered not only financial markets but 
also a variety of commodity markets, with results which at the least 
substantially revised the popular notions of the early post-war period 
that markets were irrational and highly inefficient. 

Work on general macroeconomic applications (that is to inflation and 
output) in the early 1970s was substantially that of Lucas (1972a, b) and 
Sargent and Wallace (1975). Lucas’ concern was to develop the rationale 
for fluctuations in money supply to affect output; his problem being that 
if information on money is available, then movements in money should 
immediately be discounted in prices, as everyone seeks to maintain rel¬ 
ative prices unchanged. He developed a theme due to Milton Friedman 
(1968) that individuals perceive economy-wide data such as money and 
the general price level with an information lag, and are forced to esti¬ 
mate whether the price changes they currently perceive at the market 
level are relative price changes or merely reflect general price movements 
(inflation). A positive relationship between output and money or price 
movements (a ‘Phillips curve’) can occur because of mistakes made by 
individuals in estimating current inflation; they supply more output mis¬ 
takenly thinking that the relative price of their output has risen. 

This ‘surprise’ supply function was an essential component of the 
small-scale macro models used by Sargent and Wallace to illustrate the 
potential implications for policy making of RE. They showed that in 
a ‘Keynesian’ model, if the Phillips curve (a standard Keynesian con¬ 
struct) is interpreted as a surprise supply function, then only monetary 
surprises can affect output; monetary plans, whatever they are, can only 
affect prices. Hence, in particular, sophisticated feedback rules (which, 
for example, raised money supply in response to a poor past output per- 
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formance) would be ineffective in their output stabilization objective. 
However, Lucas and Sargent (1978) have pointed out that this work was 
not intended to imply that monetary policy was in general ineffective 
for output, merely that in a particular ‘standard’ model (which they did 
not in any case endorse) to which ‘optimal’ monetary control techniques 
were routinely applied, such techniques were in fact useless. The lesson 
they drew was cautionary: monetary policy rules should carefully allow 
for the reactions of private agents to the rules themselves. 

Lucas (1975) and Sargent (1976a) proceeded in the later 1970s to 
develop an alternative to the standard model, the so-called ‘equilibrium 
business cycle’ model. In this model the information lag story is main¬ 
tained, but all markets are treated as if they are auction markets in 
which all agents are atomistic competitive units. Households are con¬ 
sumers and suppliers of labour and, period by period, compute optimal 
intertemporal plans for consumption and work based on the price se¬ 
quences they perceive themselves as facing. Firms, on the other side of 
the labour and goods markets from households, similarly compute (given 
their technology) optimal plans for hiring labour and producing goods 
based on the same price sequences. The price sequences that are per¬ 
ceived are the rational expectations equilibrium sequences that clear the 
markets today and are expected to clear them in the future. Because 
firms and households, once they have made a plan, incur ‘sunk costs’, 
there is an adjustment cost in changing plans as these past decisions 
are unwound. This imparts the correlation of prices and quantities over 
time that is a feature of business cycles. But the impulse to the cy¬ 
cle comes from shocks to the economy, whether from monetary policy, 
technological innovations or surprise shifts in household preferences. 

Such a model is by no means mandatory in the rational expecta¬ 
tions research programme. The ‘surprise’ supply function is hard to 
sustain when information about prices is as up to date as it is in the 
modern economy. Also, while the ‘as if’ assumption of auction markets 
and automatic agents may work well for economies with highly com¬ 
petitive structures, monopoly power, in labour markets particularly, is 
widespread in Western economies, markedly so in Europe, and could re¬ 
quire explicit modelling. Furthermore, the role of long term contracts — 
in goods, labour and financial markets — may be inadequately captured 
by this ‘as if’ assumption; this assumption would imply that contracts 
were approximately ‘fully contingent’ (that is, such that prices and quan¬ 
tities altered in response to shocks in just the maimer that optimal plans 
would call for if there were no contracts), yet this apparently is not gen¬ 
erally the case, for reasons that are still not well understood but may 
well be entirely consistent with rationality. Later, ‘menu costs’ (the cost 
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of changing prices, as on a printed menu) were introduced to justify the 
setting of prices in nominal contracts (Parkin, 1986; Rotemberg, 1983). 

Work by Phelps and Taylor (1977) and Fischer (1977 a,b) took non¬ 
contingent contracts in goods and labour markets as given and set up 
simple models to explore the implications. The influential model of an 
‘overshooting’ foreign exchange market in an open economy, by Dorn- 
busch (1976), centred on the interaction of ‘sticky’ contracts in goods 
and labour markets with an auction foreign exchange market. The ef¬ 
fect of changes in financial asset values due to the presence of finan¬ 
cial contracts (such as long-term bonds) similarly played an important 
role in the models of Blanchard (1981) and Minford (1980). Integration 
of labour monopoly power into these models has been carried out by 
Minford (1983). These developments implied a model different in many 
aspects from the narrowly defined equilibrium model of Lucas and Sar¬ 
gent, although it should be stressed that a model with contracts and 
monopolies where agents have rational expectations could perhaps most 
naturally be described as an equilibrium model, in that contracts have 
been voluntarily entered into and monopolies have agency rights on be¬ 
half of their members (and if the state legitimizes a closed shop, their 
non-members too). The policy properties differ substantially; there is in 
general more scope for stabilization, although whether it would be ben¬ 
eficial is another matter, requiring careful welfare analysis (see Chapter 
5). 

There were yet other strands in the ongoing RE research programme. 
B. Friedman (1979) and Brunner et al. (1980) attempted to introduce 
the modelling of learning about the model, notably about the evolving 
‘permanent’ elements in the model’s structure. Research on learning and 
whether it converges to rational expectations equilibrium continued to 
be active (Frydman and Phelps, 1983), although tractability was (and 
remains) a problem. 

Long and Plosser (1983) suggested that business cycles may be mod¬ 
elled without an appeal to information lags or the price supply function; 
they may stem from real shocks to consumer wealth which generate 
equilibrium cycles in consumption, production and relative prices. Cor¬ 
relations between money and output may be explainable by implication 
in terms of reverse causation (King and Plosser, 1981). 

This ‘real business cycle’ approach has the attraction that it relies 
on fewer ad hoc institutional restrictions than the New Classical ap¬ 
proach (where information on money and prices, though in principle 
available rapidly, is slow to arrive in practice) or the New Keynesian 
(where contracts have unexplained nominal rigidities). Kydland and 
Prescott (1982) showed that a model of this type, calibrated to US data 
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from cross-section studies, could replicate the time-series properties of 
the US post-war economy. 

An influential school led by Sims (1980) despaired of modelling the 
economy’s structure, but was interested in tirne-series analysis for predic¬ 
tive purposes within stable structures using the vector autoregression, or 
VAR (the regression of a group of variables on the past of all members of 
the group). McNees (1986) and Litterman (1986) found that such VAR 
models could predict as well as the main US commercial forecasters; 
however, in the UK during the 1980s, VAR models were outperformed 
by the main forecasters (Holden and Broomhead, 1990). Finally, a large 
and growing theoretical literature deriving from the work of Lucas (1978) 
and Wallace (1980) explored the behaviour of artificial economies peo¬ 
pled by representative agents. These models give us insights into the 
role of money, taxes and government bonds in enabling exchanges be¬ 
tween groups that may be separated by geography (as in Townsend’s, 
1980, tribes travelling in opposite directions along a turnpike) or time 
(as in Samuelson’s, 1958, overlapping generations models, for example, 
Wallace, 1980). 

In the rest of this book we develop these themes in macroeconomics 
— in its modern guise macroeconomics is based entirely on the idea that 
agents are rational. Hence rational expectations is central to the subject 
today. 
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Solving Rational Expectations 
Models 


In Chapter 1 we defined the rational expectations hypothesis (REH) as 
the assumption that people’s subjective probability distributions about 
future outcomes are the same as the actual probability distributions 
conditional on the information available to them. In practice we will 
be concerned with moments of these distributions, most frequently the 
mean, but also occasionally the variance, and very rarely the higher mo¬ 
ments (the skewness and so on). When people talk about ‘expectations’ 
in popular discourse they mean some single number for the future out¬ 
come that is expected to occur; but a moment’s thought shows at once 
that this is not a sensible definition of expectations. Since the future 
is governed by chance, this exact number will only occur by chance. A 
better definition would be that ‘the expectation’ is some summary mea¬ 
sure of what may happen, that is, of the probability distribution. Such a 
summary measure would be, for the central tendency of the distribution, 
the mean (the first moment); and for the tendency for dispersion around 
the mean, the variance (the second moment). Then either implicitly or 
explicitly there would be an indication of the asymmetry of the distri¬ 
bution (its skewness or third moment), and its truncation at the tails 
(its kurtosis, or the fourth moment). These moments of a distribution 
over x t are, respectively, Ex t , E(x t — E x t ) 2 , E{x t — E x t ) 3 , E(x t — E 
x t ) 4 and so on for higher moments. Hence ‘expectations’ are shorthand 
for some measure of a probability distribution; in practice we use the 
mean as the main summary measure, assuming that the other moments 
are known in some way — this implies that the main feature of the 
distribution that is changing over time is its mean. 

The mathematical term for the mean of a distribution is its ‘expected 
value’, and it is usual for applied work on the REH to identify the ‘ex¬ 
pectation of x t +i ( x at time t + z)’ with the mathematically expected 


42 




Solving Rational Expectations Models 


43 


value of xt+i- In this book we shall use the notation E t+ jXt+i for expec¬ 
tations framed for the period t + i. on the basis of information generally 
available at time t + j: j, i can be positive or negative. E is the mathe¬ 
matical expectations operator, meaning ‘mathematically expected value 
of’. Formally E t+ jX t +i is defined as E(x t +i | <E>t+i) where &t+j is the set 
of generally available information at time t + j. Of course, once x t+l is 
part of the information set then E t +jXt+i = x t +i trivially. 

If we wish to indicate that the information available to those fram¬ 
ing expectations is restricted to a set 9 t +j at t + j, we shall write 
E t +j ( x t +i | Ot+j ), that is, the expectation of x at t + i framed on the 
basis of information set 9 available at t + j. It is natural to think of 
Et+jXt+i as ‘expectations formed at t + j of x at t + this will do for 
some purposes but it is not quite accurate. It is not in fact the date 
at which expectations are formed that matters but rather the date of 
the information set on the basis of which they are formed. Because of 
information lags, people may form expectations for this period on the 
basis of last period’s information, and we would write this as E t _iXt- 
Suppose for extreme simplicity that the model of Xt is: 

x t+ i = x t + e t +i (1) 

where e* is normally distributed with a mean of 0, a constant variance 
of (j 2 , independence between successive values, and independence of all 
previous events; that is, e* : N( 0, a 2 ), E(e t + £t+j ) = 0 (i ^ j) and 
Es t+i | 4> t+ j =0 (■ i > j). Equation (1) states that x t follows a ‘random 
walk’ (the change in Xt is random). 

The expectation of x t at t is x t if we assume that people know x t 
then. They cannot know £t+i because it has not yet occurred and as a 
random variable its expected value is zero. If we write as the total 
information set at t, then: 

EtXt+i = E[xt +1 | ‘ht) = (Xt + £t +i | &t) = Xt + E(e t +i \ = xt. 

( 2 ) 

E t x t .+i will be an unbiased predictor of x t +i, that is, the mean (or 
expected value) of the prediction error x t +i — E t x t+ i is zero. Thus: 

E(xt+ i — E t xt+ 1 ) = E(x t + £t+ i — x t ) = Ee t+1 = 0 (3) 

E t x t + i will also be the efficient predictor of xt+ i, that is, the variance 
of the predictor error is smaller than that of any other predictor. Thus: 

Variance (x t+ i - E t x t + 1 ) = E(x t+ 1 - E t x t+ i) 2 = E(e t+1 ) 2 = a 2 (4) 

This is the minimum variance possible in prediction of x t +\ because 
St -|-i is distributed independently of previous events (the meaning of 



44 


Models of the Economy 


‘unpredictable’). Suppose we add any expression whatsoever, say j3z t , 
where Zt is a variable taken from <E> t , to E t x t +i , making another predictor 
x t +i- 

x t+ i =x t + (3z t (5) 

Then: 

E(x t +1 - x t+ 1) 2 = E(s t +1 - (3z t ) 2 — a 2 + 0 1 Ez 2 (6) 

The variance will be increased by the variance of the added expression, 
because this must be independent of e t +\. 

The unbiasedness and efficiency of their forecasts are the two key 
properties of rational expectations forecasts that we will constantly re¬ 
turn to in this book. However for the time being, in this chapter, we 
shall restrict ourselves to explaining how the rational expectation of vari¬ 
ables determined in more complex models is to be found, and how those 
models are accordingly to be solved. 



Figure 2.1: A simple macroeconomic model illustrated 


THE BASIC METHOD 

Now take a simple macro model (illustrated in Figure 2.1): 

m t =pt + yt 

Pt = Et-m + 6(y t -y*) 


( 7 ) 

( 8 ) 
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m t =m + e t (9) 

where m, p, y are the logarithms of money supply, the price level, and 
output respectively; y* is normal output, to is the monetary target (both 
are assumed to be known constants). Equation (7) is a simple money 
demand function with a zero interest elasticity and a unit income elastic¬ 
ity: in Figure 2.1 it is drawn as an aggregate demand curve with a slope 
of —1. Equation (9) is a money supply function in which the government 
aims for a monetary target with an error, e t , which has the properties 
of our previous e in (2.1). As the error shocks the economy, aggregate 
demand shifts up and down around D*D*, its steady-state position set 
by to. Equation (8) is a Phillips curve as can be seen by subtracting p t _ i 
from both sides; in this case it states that the rate of inflation equals 
last period’s expectation of the inflation rate plus a function of ‘excess 
demand’. We can think of the ‘periods’ as being ‘quarters’ and prices as 
being set, as quantities change, on the basis of last quarter’s information 
about the general price level - hence we appeal to an information lag of 
one quarter and E t -\ refers to this quarter’s expectation formed (the op¬ 
erative element) on the basis of last quarter’s information. In Figure 2.1, 
equation (8) is drawn as the aggregate supply curve; rising output re¬ 
quires rising prices, given expected prices, because each producer wants 
his own relative price to be higher to compensate for the extra effort of 
higher supply. The vertical supply curve, S*S* , is the long-run Phillips 
curve, indicating that when producers know what the general price level 
is they will not be ‘fooled’ into supplying more output as it rises because 
they realize their own relative price is unchanged. 

This model has three linear equations with three endogenous vari¬ 
ables, two exogenous variables, to and £ t , and an expectation variable, 
E t -iPt- Given the expectation, we can solve it normally, for example, 
by substitution. So substituting for m. t and p t from (8) and (9) into (7) 
gives us: 

to ■ + £t = E t -xp t + (1 + 6)y t — 6y* (10) 

This corresponds to the intersection of the D'D' and SS curves in 
Figure 2.1. But we now need to find E t ^ip i; to get the full solution. 

To do this, we write the model in expected form (i.e. taking expec¬ 
tations at t — 1 throughout) as: 


Et-im = E t ^ip t + E t _iy t 

( 7 e ) 

Et-iPt = E t -ipt + 6(E t -iyt. — y*) 

( 8 e ) 

Et-imt = rn 

( 9 e ) 
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Substituting (8) e and (9) e into (7) e gives: 

E t -ip t =m — y* (11) 

This is the intersection of the D*D* and S*S* curves in Figure 2.1. 
S*S* shows what producers will supply on the assumption that the prices 
they receive are those which they expect (this is an ‘expected supply’ 
curve); D*D* shows what output will be demanded at different prices 
on the assumption that the money supply is ra, as expected — (an ‘ex¬ 
pected demand’ curve). Where these two curves intersect is accordingly 
expected output and prices. 

Equation (11) is substituted into (10) to give the full meaning of the 
p t , yt intersection in figure 2.1: 

Vt = V* + Y^S £t ( 12 ) 

Consequently, from (8) and using (11): 

p t = m - y* + Y^Ts £t ( 13 ) 

The solutions for y t and pt consist of an expected part (y* and m — 
y*. respectively) and an unexpected part (functions of e*). Rational 
expectations has incorporated anything known at t — 1 with implications 
for p and y at time t into the expected part, so that the unexpected part 
is purely unpredictable. 

This model, though simple, has an interesting implication, first point¬ 
ed out by Sargent and Wallace (1975). The solution for y t is invariant to 
the parameters of the money supply rule. Output would in this model be 
at its normal level in the absence of surprises, which here are restricted 
to monetary surprises. If the government attempts to stabilise output 
by changing the money supply rule to, say, 

m t = m - (3(y t -i - y*) + e t (14) 

then still the solution for output is (12), because this money supply rule 
is incorporated into people’s expectations at t — 1 and cannot cause any 
surprises. The only effect is on expected (and so also actual) prices: 

Et-m = m- P(y t -i -y*)-y* = m-y* - —^e t -i (15) 

l + o 


Pt=m- y* 


(3 S 

IT ^- 1 + T+s £t 


(16) 


Note that this will raise the variance of prices around their long-run value 
m — y* by (yyy) 2 u 2 . This is illustrated in Figure 2.2, where we start this 
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Figure 2.2: A simple model with an interventionist monetary rule 


model out from steady state at period t — 1, where E t -\Po = fn — y*, 
Et-iyo = y *; let there be a shock in period 0, £o- 

The solution for (po, ijq) is the same as before. But now the govern¬ 
ment responds in period 1 with a money supply contraction, reducing 
m to m — this shifts the aggregate demand curve to D"D". But 

because producers know in period 0 that this reaction will occur, they 
work out the intersection of their expected supply S*S*, and the ex¬ 
pected demand, D"D", correctly anticipating that p 1 =m — j+gS o — y*■ 
Where these curves intersect is accordingly expected output and prices. 

This of course contradicts the well-known results for models with 
backward-looking expectations whereby stabilization policy by govern¬ 
ment can reduce fluctuations in output, provided the government chooses 
the appropriate monetary target. For example, suppose we had assumed 
in accordance with the popular practice of the 1960s that expectations 
of the price level were formed adaptively. The adaptive expectations 
hypothesis is that: 

x e t - x\_ x = p{x t - 1 - 0 < p < 1 (17) 

or that expectations of x t change by some positive fraction, p, of last 
period’s error. This can be written equivalently as: 

Xt = y-xt-i + (1 - p)x e t _i = pxt-i + (1 - p)[px t -2 + (1 - y)xl_ 2 \ 

OO 

z=0 


t-l-i 


( 18 ) 
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by continuous substitution for xf_ 2 , Xf_ 3 ... 

Substituting p% for E t -iPt in our simple model of (7) to (9) turns 
it into an orthodox dynamic model to be solved by standard methods. 
Equation (8) becomes: 

OO 

Pt = M ^ (! — pTPt-i-i + 6(y t - y*) (2.8°) 

i=0 

We can see that expected prices depend not on the planned money 
supply but on past events (past prices), which were known to the gov¬ 
ernment last period. Consequently the government can plan a money 
supply for this period confident that it will not be ‘frustrated’ by a re¬ 
sponse from expectations. They can set a target to*, such that y t = y* ■ 
This will be a target which accommodates prices at their expected level, 
delivering p t = pf; for (8) a assures us that when p t = pf , y t = y*. By 
(7), when p t = pf and y t . = y*, then: 

OO 

m* = p e t + y* = p (1 ~ P) l Pt-i-i + V* (19) 

i =0 

We now find that the solution for output depends on the deviations 
of money supply from this optimal target: 

Vt = y* + ~ m *) ( 20 ) 

These deviations may be due either to unpredictable errors, e, as in 
the rational expectations (RE) case, or to a policy failure to plan mt at 
to*; in other words: 

m t — to* = e t + m T — m* (21) 

where m T is the actual policy target. But in this adaptive model both 
affect output, whereas in the RE version only e, the error term, does. 
In other words, the monetary policy chosen affects output not, as in the 
RE case, merely the monetary surprise. 

In subsequent chapters we shall be examining this RE model and a 
number of considerably more complex RE models whose properties will 
differ from this one substantially. Nevertheless it is a common feature of 
all these models that there is an important difference between the effects 
of an anticipated and of an unanticipated change in any exogenous vari¬ 
able; by contrast, in models where expectations are formed adaptively 
(or as any fixed function of past data) it makes no difference. This is 
probably the most fundamental result of rational expectations. It is the 
nature of the difference of these effects that forms the detailed study of 
RE models. 
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The method of solution set out above (the ‘basic’ method) will suffice 
for all RE models in which there are expectations (at any date in the 
past) of current events only. To repeat, this method involves three steps: 

1. Solve the model, treating expectations as exogenous. 

2. Take the expected value of this solution at the date of the expec¬ 
tations, and solve for the expectations. 

3. Substitute the expectations solutions into the solution in 1, and 
obtain the complete solution. 

RE MODELS WITH EXPECTATIONS OF FUTURE 
VARIABLES (REFV MODELS) 

R will very often, in fact almost invariably, be the case — in the nature 
of economic decisions which, as we have seen, involve a view of the 
future — that expectations of future events, whether formed currently 
or in the past, will enter the model. For these REFV models, our basic 
method must be supplemented and it can be replaced by more convenient 
alternatives. 

For example, add to our previous simple model the assumption made 
by Cagan (1956) in his influential study of hyperinflation, that the de¬ 
mand for money responds negatively to expected inflation (we can think 
of this as approximating the effect of interest rates on money demand in 
less virulent inflations). Let the model now be: 

m t — p t + y t . — a(E t -ip t+ i - E t -ip t ) (a > 0) (22) 

Pt = Et-m + S(y t - y*) (8) 

m t =m + e t (9) 

We keep (8) and (9) as before. In (22) expectations of inflation in the 
current period are regarded as formed on the basis of last period’s (quar¬ 
ter’s) information; as in (8) we are appealing to an information lag. 

Let us use our basic method and see how it has to be adapted for 
this model. Step 1 (solving given expectations as exogenous) gives us: 

m + £* — Pt + ~ E t _ip t ) + y* — a(E t _ip t+ i — E t _ip t ) (23) 

This is the same intersection ( p t , yt) as in Figure 2.1, except for the 
extra term — a(E t -iPt.+i — E t -ipt) which shifts D D relative to what 
is shown there. 
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To find Et-iPt and E t ~iPt+i we now take expectations of the model 
at t — 1 (step 2) to yield: 

m - y* = (1 + a)E t _\p t - aE t _ x p t+1 (24) 

Equation (24) can solve for E t _\ p t in terms of to, y* , and E t _ip t+ i. 
But this is not a solution because E t -ipt+i is not solved out; we appear 
to have shifted the problem into the future. 

To solve for E t -iPt+i we may lead the model by one period (for 
example, write (22) as m t+ i = Pt+i + Vt+i - a{E t p t +2 - E t p t +i)) and 
take expectations of it at t — 1 as before. This yields analogously: 

to - y* = (1 + a)E t -ip t+ i - aE t -ip t+ 2 (25) 


We have now solved for E t -ipt+i in terms of m, y* , and E t -\Pt +2 
again shifting the problem into the future. This naturally leads us to 
solving for expected values using the method of forward iteration pro¬ 
posed by Thomas Sargent. We write (25) as: 

Et-iPt+i = -r ~-— (Jn - y*) + -^—Et^pt +2 (26) 

1 + a 1 + a 

Substitute successively (forwards) for E t ~iPt+ 2 , E t -iPt +3 and so on 
in (26) to obtain: 


N—l / \ 1 / \ i\ 

= (its) ("-!'') + (tt^) £•-■»'+» ( 27 > 

2=0 X 7 X 7 

Let N —* 00 and assume (as seems natural) that E t _\p t+l is stable, 

/ \ N 

so that Et-\Pt+N —*• its equilibrium as N —> 00 . Since I j I —> 0 
also as N —> 00 the final, remainder, term in (27) disappears and (27) 
becomes: 


N 


-j OO 

Et ~ lPt = (28) 
2=0 

We can reach the same result by using the forward operator, B(B 
is the backward operator that instructs us to lag the variable but not 
the expectations date, unlike L which instructs us to lag both). 

Write (24) as: 


(1 + a) 



Et-ipt = m — y* 


(29) 


It follows that: 
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Et-iPt = 


1 


to — y 


1+al 


-°L- B~ 1 

1 +a 


1 +Q -n 
2=0 


a 


-B~ 


a 


(m-y*) 


= to — y* (30) 


In this particular case, the exogenous variables are constant. How¬ 
ever, Sargent’s method can be generalised; for example suppose that the 
money supplies were exogenously given to us (it might be that each pe¬ 
riod the central bank’s policies are reassessed in the light of a variety 
of current information including bank announcements and the result is 
most simply written down as a new set of projections each time.) The 
model is the same, (22) and (8), except for the omission of (9). 

(23) now becomes: 

m t =Pt + ^{Pt - Et-m ) + y* - a(E t -ip t+1 - E t -ip t ) (31) 

Subtracting from this its expected value at t — 1 yields: 

Pt - Et— \Pi = ——r(m t - .Et_im t ) (32) 

1 + 0 

which tells us that the solution for prices depends on the revision to the 
money supply planned at t plus expected prices. 

To find expected prices take expectations of (31): 

Et-imt - y* = (1 + a){E t -ip t ) - a(E t -ip t+ i) (33) 

from which we obtain using the forward operator: 

.. OO 

Et-iPt = [ t — ^( t ^ - Y E t-imt+i\ - V* ( 34 ) 

1 + a z —' 1 + a 

i=0 

In other words the whole path of future monetary policy foreseen at t — 1 
determines expected prices for t. 

We have now seen how in a rational expectations model the expected 
future affects expectations of the present. Plainly the direction of cau¬ 
sation is from the (expected) future to the (expected) present. We note 
that (24) is a difference equation which had to be ‘solved forwards’ by 
iteration into the future; the present depends on the future via a stable 
root, -+W However, it is possible — though on reflection odd — to 
look at the relationship differently, as one where the expected present 
affects the expected future. Looking at it this way draws attention to the 
possibility that a rational expectations model may have self-generating 
explosive paths or ‘bubbles’. 

Suppose we go back to (24) and (25). We could have carried on in 
this way indefinitely and it is easy to see that we would have obtained a 
series of equations which could be written as a sort of difference equation: 

to - y* = (1 + a)E t -ip t+ i - aEt-iPt+i+i (i ^ 0) (35) 
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This is actually a difference equation in a variable pj + i> defined to be 
Pt+i as expected from t — 1: 

Pt+i+ 1 - - y*) (i ^ 0) (36) 

The solution of this first-order non-homogenous difference equation is 
familiarly: 

Phi = m ~V* + \Pt~ i m ~ (* > 0) (37) 

a 

where m — y* is the equilibrium of pt (the ‘particular’ solution), Th“ is 
the unstable root (note that it is the inverse of the stable forward root 
when we solved the model forwards) and p% — (m, — y*) is the constant 
(determined by the initial value pf) in the ‘general’ solution. Here we 
are solving the model ‘backwards’ from the future to the present in the 
sense that the (expected) future is depending on the (expected) present; 
the same root that was stable when the model was solved forwards is 
now unstable when the model is solved backwards. 

This can be understood from Figure 2.3. Here we have drawn the 
long-run Phillips curve, S*S *, and the aggregate demand curve, D*D*, 
on the assumption that prices are not expected to change (. E t -\Ap t +\ = 
0). On this assumption the expected price level is m — y* as before (see 
Figure 2.1). But we can rewrite the expected solution of equations (22) 
and (9) as E t -iAp t +i = ^(E t _ip t +y* — m) which shows that if expected 
aggregate demand (E t ~iPt+y*) exceeds the expected money supply, m, 
then it must be because prices are expected to rise; and vice versa. So 
to the right of D*D* , prices are expected to rise, and to its left they are 
expected to fall, as shown by the arrows on Figure 2.3. Since output 
is always expected to be y* on S*S*, the possible solution for E t _ip t 
and subsequent E t _ip t+ i are shown by the arrows on S*S*. One such 
solution is shown by the intersection of the D D dashed curve showing 
the expected aggregate demand curve, when prices are expected to rise. 


CHOOSING A UNIQUE DEMAND PATH 

Equation (37) and Figure 2.3 give an infinite number of solution paths 
for pl +i (i ^ 0). For we are free to choose any value of p) we like; the 
model does not restrict our choice. Another way of looking at (37) is to 
say that we can choose any future value for any p^ +i we wish and work 
back from that to a solution for p) : . We could already have guessed that 
this would be so from (24) for, to obtain the expectation of a current 
value, we were compelled to take a view about Et-ipt+i■ Any view of 
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Figure 2.3: The solution expected at t — 1 of a simple REFV model, illustrated 
for an unstable path 


this future will then compel a present which is consistent with it; any 
set of expectations is therefore self-justifying. 

REFV models (that is, the vast majority) would be little better than 
curiosa if they did not carry with them additional restrictions sufficient 
to define a unique solution; for they would merely assert in effect that 
‘anything can happen provided it is expected, but what is expected is 
arbitrary’. Worse still, as (37) illustrates, these paths for events can be 
unstable; in fact, our model here implied that all paths for prices except 
that for which pi = fn — y *, explode monotonically as shown in Figure 
2.4. Thus our particular REFV model would assert that only by accident 
would an equilibrium price level be established, otherwise prices would be 
propelled into either ever-deepening hyperdeflation or ever-accelerating 
hyperinflation, even though money supply is held rigid! (Output in this 
model is always expected to be equilibrium.) While such an assertion 
may appeal to some it has not impressed those who have espoused RE 
models; they have looked instead for additional restrictions. 

We have already hinted at the source of an additional restriction in 
our model by noting the instability of all but one path. It is clear that 
the unstable paths are in some sense absurd. The question is: what 
would prevent them? It has to be the case that behaviour would alter 
in such a way as to prevent them. 

Consider, for example, the path of ever-accelerating hyperinflation 
anticipated fully now (on the basis of last period’s information). People 
deciding how much money to hold for transactions would expect now 
that in so many years they will need truckfuls of money to buy the daily 
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groceries; they would therefore find an alternative means of carrying 
out transactions to avoid the investment in trucks they will otherwise 
anticipate. They would use beans or cows or sophisticated forms of 
barter to replace the old money. Ultimately the old money would not be 
used at all; prices would be defined in the new money, say beans. 

But money has an issuer; it may be a bank or a government. The 
issuer derives profits from people’s use of its money issue, and it will pay 
them to avoid its replacement. This it can only do by stopping any such 
hyperinflation ‘bubbles’ occurring. It turns out that a commitment on 
the issuer’s part to put an end to any such inflation at some point, by 
decreasing the money supply at a sufficient rate to offset any decline in 
real money balances held, will do the trick. For if people expect that 
inflation will stop at some period t + N (at which the bank will ‘step in’), 
then this implies an arresting of the very ongoing process that sustains 
the earlier path. Real money balances desired in t + N will now be higher 
than anticipated in that path, so inflation must be lower in t + N. But if 
this is so, then real money balances in t + N — 1 will be similarly higher, 
so also inflation will be lower then; and so on. The whole path will be 
invalidated. 



Figure 2.4: The solution paths for the price level expected at t — 1, as in 
equation (37) 

In fact we can show this formally by imposing on the difference equa- 



Solving Rational Expectations Models 


55 


tion (37) the condition that : 

Pt+i +1 - Pt+i = 0 {i>N) (38) 

and letting (37) run from N ^ i ^ 0, since t + N is the period when the 
bank’s new regime takes over. Using (37) for i = N, we have: 

rn-y* = p e t+N + a(p e t+N - p e t+N+1 ) = by (38) p e t+N (39) 

By (37) this implies: 

/I - \ N 

m-y* =m-y* + \Pt - (m-y*)\ {^—^J (40) 

or: 

p e t =(m-y*) (41) 

It can be seen that (41) when applied to (37) selects the unique stable 
path for Pt +i so that: 

Pt+i =m-y* (i> 0) (42) 

An analogous argument can be constructed for the path of ever- 
deepening hyperdeflation. In this case people will ‘demand’ infinite 
amounts of money because its return is infinite in the long term; this 
implies that the money will be hoarded and disappear from circulation. 
The bank or government will wish to prevent this (because otherwise 
some other money will come into existence) by issuing money until the 
profit rate on the issue has returned to a normal level, that is the rate of 
deflation is zero. The knowledge that the issuer will go on issuing money 
until this occurs acts to impose the same condition (38) on the model. 

We have constructed verbal arguments to justify the imposition of a 
‘terminal condition’ such as (38) in our model. These arguments appeal 
to forces not explicitly in the model, but which would be brought into 
play by certain types of behaviour apparently allowed for by the model. 
These forces will differ from model to model; for example we may appeal 
to legal controls or supervisory agencies to ensure ‘orderly markets’, or 
to competitive forces 1 , or to precepts upon government itself. But an 
RE model with expectations of the future (REFV model) is incomplete 


1 For example, in the competitive equilibrium model of the labour market of Lucas 
and Sargent, as set out in, for example, Sargent (1979a, chapter 16) the transver- 
sality conditions of households and firms supply the necessary terminal conditions. 
These conditions are necessary for optimality; in other words, explosive paths for 
labour supply and demand are not followed by households or firms because they are 
suboptimal. 
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without some forces of this kind to supply an additional restriction, such 
as the terminal condition here. 

Another way of describing our ‘terminal’ condition would be as a 
‘side’ or ‘transversality’ condition: all these express the same idea, that 
there is an additional restriction on the model, here coming from govern¬ 
ment or central bank behaviour designed to rule out what is from their 
(or society’s) viewpoint an undesirable outcome, in this instance for the 
monetary environment. We will come across other such transversality 
conditions later in this book (for example, on private or government 
borrowing designed to rule out unsustainable and thus undesirable bor¬ 
rowing paths). 

Our terminal condition (38) has the effect in the model here of se¬ 
lecting the unique stable path. For REFV models with such a unique 
stable path like the one here (that is, with the ‘saddlepath’ property, so 
called because any deviation from this path is unstable), the imposition 
of terminal stationarity on the expectations ensures the selection of this 
path. For such models, it is therefore only necessary to specify as a side 
condition on the model that the solutions be stable or stationary; this 
condition is referred to variously in the literature as the ‘stability’ or 
‘stationarity’ or ‘convergence’ condition, or ‘ruling out speculative bub¬ 
bles’ or ‘boundedness’. We appealed to it when we used the forward 
iteration or operator method above; to obtain that solution we had to 
assume that the expected price sequence was stable. 

We have now completed step (2) in our solution procedure, albeit in 
a more complex manner than before; call it step (2'). We proceed to 
step (3) and substitute for E t _\p t , E t -iPt+i into (23). It turns out in 
this model that the solution is the same as for our earlier model, as the 
reader can easily verify. 

We may now review our basic method for solving REFV models: 

1. Solve the model, treating expectations as exogenous. 

2. Take the expected value of this solution at the date of the ex¬ 
pectations. If the model generates a unique stable path for the 
expectational variables, impose the stability condition, and derive 
this solution for the expectations. Do so either by the forward 
iteration or the backward difference equation method. 

3. Substitute the expectations solutions into the solution in 1 and 
obtain the complete solution. 
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BUBBLES 

The significance of the terminal condition that enforces stability can be 
seen by considering ‘bubbles’ or ‘will o’ the wisp’ variables. Suppose 
we had no such terminal condition. Then it can be seen that it can be 
rational to expect at t — 1 any price level for t provided one also expects 
an ever-exploding price level for the future, the ‘bubble’. 

It is possible to add arbitrary variables to the solution of REFV 
models provided they obey certain processes dictated by the coefficients 
in the model’s future expectations (see, for example, Canzoneri, 1983; 
and Gourieroux et al., 1982). 

For example, take the model we have been looking at, of (22), (8) 
and (9). Suppose people believe at t — 1, for no good reason, that prices 
would be affected by (fits)* E t ~iZt+i where: 

Et-iZt+i = zt -i (43) 

(that is, Zt is a martingale). Their belief, though ‘irrational’, would 
formally be validated by the model, since 

Et-m+i = (m- y*) + ( 1 + a yE t - 1 z t +i (44) 

a 

is a solution to the model, as can be verified by substituting (44) and (43) 
into (24). Any ‘will o’ the wisp’ variable, z t ~ i, could therefore produce 
an irrational solution to an REFV model by this self-validating process. 

This is simply an implication of the indeterminacy of pi we noted 
earlier in commenting on equation (37); so we can write pi — (m — 
y*) = /i t _! where is anything. However, the solution to the bubble 
problem is one and the same as that of the indeterminacy and instability 
problem: we have to impose an additional restriction on the model to 
ensure determinacy and stability. Since the exploding bubble must at 
some point violate the terminal condition, the whole path collapses back 
to the unique stable solution. 

This terminal condition approach to ruling out bubbles is similar in 
effect to McCallum’s suggestion (1983) that a ‘minimum set of state 
variables’ (MSV) be imposed on the solution; that is, one eliminates as 
many state variables as possible from the solution while still maintaining 
consistency with the rational expectations constraint. In effect the min¬ 
imum set excludes any such extraneous variables that enter as bubbles. 
We would argue that the justification of imposing MSV lies in the op¬ 
timizing transversality conditions on the agents in the model. However, 
of course in practice the procedures deliver the same solution. 
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OTHER METHODS OF SOLUTION FOR REFV 
MODELS 

Not surprisingly there are several other methods for finding the unique 
stable solution to an REFV model which has one. We shall explain two 
in detail because they have been widely used: the Muth method of unde¬ 
termined coefficients and the Lucas method of undetermined coefficients. 

The Muth Method of Undetermined Coefficients 

The Muth method starts from the proposition that the general solution 
of our model can be written (via the Wold decomposition — see the 
time-series annex at the end of the book): 

OO 

Pt=P+^2 n i £ t-i (45) 

i=0 

OO 

yt=y+^2 ( 46 ) 

i =0 

where p, y are the equilibrium values of p t , y t . Note that this way of 
writing the solution assumes that there are no expected future exogenous 
variables or else that they can be entirely substituted out in terms of 
current and past events. This implies that the Muth and Lucas methods 
are not entirely general (in particular it means that the forward root is 
solved backwards and so appears in its inverse, unstable, form), but they 
are useful for the wide class of models where this assumption is valid. 

Let us focus on the solution for p t , since that for y t follows easily 
enough, y = y* and p = rn -y* by setting E t -ip t = E t -ip t +i =Pt =P 
and yt = y in the model. 

Having found the equilibrium in terms of the constants, we now drop 
these from the model and define (p t , y t ) in deviations from equilibrium. 
The model can now be written in terms of p t as: 

£t = (1 + ^)pt + {a - i)E t _ip t - aEt-iPt.+i (47) 

Using (45): 

OO 

t T t e t -i (48) 

i =o 


OO 

Et-lPt n i £ t-i 

i=l 


(49) 
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oo oo 

Pt+ 1 =y t niEt-i -1 n i+ ie t -i (50) 

i =0 i= 1 


Ef-lPt+l (51) 

1=1 

Equations (49) and (51) follow from (48) and (50) respectively because 
Et-ist = E t -i£ t +i = 0. 

Substituting (48)-(51) into (47): 


£t - 



ot ~ T J 7r i £ t-i + a ' ,T i+l £ t- 


i=1 


i=1 


= 0 
(52) 


Each e t _i can be any number so that (52) can hold if and only if the 
set of the coefficients on e t , on e t _i, on e t -2 ■•••> each individually sums 
to zero. These sets must satisfy: 

1 - ^) 7r° = 0 (53) 


(on £ t ) 1 - 


(on e t -i,i ^ 1) - (1 + a)iTi + auq+i = 0 (54) 

Equation (54) is a homogeneous, difference equation in ni with the 
same root as (27) above, and an analogous solution: 

< 55 ) 

Note that the forward root here is being ‘driven backwards’ artificially. 
In (55) again we see that there are an infinity of solutions chosen here by 
selecting 7Ti arbitrarily and that only one is stable, namely that where 
7ri = 0, which of course stops the forward root operating backwards in 
an unstable manner. 

Invoking the stability condition we set = 0, so that 7q = 0 (i ^ 1). 
From (53) we obtain no = jng- Our solution in p t is therefore: 

Pt=m-y* + (^-^j e t (56) 

as before. 

The Muth method becomes unwieldy for larger models where there 
are several errors like e t for each of which a sequence of coefficients must 
be determined, but it is often convenient for small illustrative models, 
and we shall use it frequently for this purpose. 
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Lucas Method of Undetermined Coefficients 

A variant of the Mutli method of undetermined coefficients has occa¬ 
sionally been used (for example, Barro, 1976; Lucas, 1972a) whereby the 
solution for the endogenous variables, instead of being written in terms 
of the constants and the errors, is written in terms of the ‘state’ variables, 
that is, current and past values of the exogenous variables (including the 
error terms of the model equations) and past values of the endogenous 
variables. (It therefore is like the Mutli method in assuming that ex¬ 
pected future exogenous variables can be reduced to current and past 
events; it, too, therefore drives the forward root backwards.) The need 
to include all the state variables can make this method unnecessarily 
complicated, as the example of this model shows. 

Write the solution for p t (on which we focus) as 

p t = 7Ti£ t -I- n-iPt-i + Kzyt-l + 7T 4 £t-i + 7 r 5 m + 7r 6 y* (57) 

We have: 

m + e t = p t + \(pt - E t _ip t ) + y* - a(E t -ip t +i - E t -ip t ) (58) 
o 

Use (57) to generate E t -ipt, E t -ipt+i and substitute for these and 
Pt in (58), obtaining: 


m + £ t = 7 T t £t + TT 2 Pt-l + 7T32/ t _l + 7T 4 £ t _l + 7T 5 TO + 7T 6 y* + -r(7Tl£ t ) 

0 

+ y* - a[(7T2 - l)(7T 2 Pt-l + 7T 3 7/t —1 + 7T 4 e t „l + 7r 5 7n + 7T 6 y*) 

+ TT 3 y* + 7r 5 m + 7r 6 y*] (59) 


We used E t -iyt = y* in this, from the Phillips curve. Now by the 
same argument as with the Muth method, the terms in each of the state 
variables must equate. So we have: 

(terms in e t ): 1 = 7 t 4 + ^7r 4 
o 


yielding: 



6 

TT^ 


(60) 


(terms in p t ~i) 0 = 7r 2 — a(7r 2 — 1)7 t 2 = 7r 2 (l + a) — an 2 (61) 

from which there are two solutions for 7r 2 = 0, i±2i. Of these, 

(the forward root again being artificially driven backwards) violates the 
stability condition and is ruled out, leaving 7 t 2 = 0. 
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(terms in yt-i) 0 = 7r3 — a( 772 — l)7r3, implying 713 = 0 (53) 

(terms in e t _i) 0 = 774 — 01(772 — 1 ) 774 , implying 774 = 0 (54) 

Given these solutions, the terms in m and y* yield 775= 1, 776 = 
— 1. Hence we have obtained, if by a somewhat round-about route, the 
solution for p t : that for y t follows simply using the Phillips curve. 

Clearly the method of solution is a matter purely of convenience. We 
have discussed several methods, all of which have been extensively used 
according to the problem and tastes of the problem solver. All have 
their advantages and disadvantages and are worth the reader’s while to 
understand. 


THE TECHNIQUES IN APPLICATION: A MORE 
COMPLICATED EXAMPLE 

We now use a slightly more elaborate REFV model (with a unique stable 
solution) to illustrate further the application of these solution methods. 
The model here is ‘fully dynamic’, that is to say it returns to its steady 
state gradually after a shock rather than immediately as our previous 
models did. As such it is a prototype for many macro models used in 
practical analysis. 

We retain our Cagan-style money demand equation but date the 
expectations at t for convenience in the money market 2 . We also retain 
our simple money supply equation (9); but we allow for adjustment 
costs in the response of output to unexpected price changes (our Phillips 
curve). So now we have a new model: 

m t =p t +yt -a(E t p t+ i -p t ) (a > 0) (62) 

yt-y = ^{Pt -Et^xpt) +n{y t -i -y ) = -j -— : _ L — (63) 

mt =m + e t (64) 

where we have used the backward lag operator, L, in rewriting (63) to 
facilitate our subsequent operations. 


2 This dating of expectations implies that agents in the money and bonds markets 
have access to all current information whereas those in the goods and labour markets 
only have access to last period’s — not a set-up with much theoretical appeal. Exactly 
what information which agents have is discussed carefully below (especially in the 
following chapters, 3 and 4). Here we make this assumption merely to illustrate our 
techniques with less complication. 
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Basic Method 


Let us apply our adjusted method, focusing on the solution for p t . Step 
1 gives, substituting for (63) and (64) into (62): 

m + e t = (1 + a)p t +y* + j — , El ~ lPt ' 1 - aE t p t+1 (65) 

o 1 — pL 

Rearranging and multiplying through by (1 — pL) yields: 


{m - y*)( 1 - p)+£ t - p£ t ~i = -aE t p t+ i + 


a 


Pt + 



E t ~iPt ~{p + ap)Pt ~i 


( 66 ) 


Notice that the lag of E t p t+1 is E t _ip t and not, for example, p t or 
Et-iPt- 

We now move to step (2'), where we must find E t p t+ i and E t _ip t+ i. 
Accordingly, first we take expectations at t = 1 to obtain: 


(to - y*)(l - p) - p£ t - 1 = 

- aEt-iPt+i + (1 + a + ap)E t -ip t - (p + ap)p t -i (67) 

and: 

(m-y*){l-p) = -aEt-xpt+i+x + (1 + a + ap)E t _ 1 p t+i 

-{p + ap)E t ^ 1 p t+ i-i (i ^ 1) (68) 

The solution of (68) is: 

Et-iPt+i = (m - y*) + Ai ^ + A 2 p l (i > 0) (69) 

where A and B are determined by the initial values E t -ipt+i, E t ~ipt- 
However, we have only one equation (67), to determine both E t -iPt+i 
and Et-iPt, so that there is an infinity of paths, all but one unstable (this 
situation of a unique stable path from which movement in any direction 
is unstable is known as the ‘saddlepath’ property). Impose the stability 
condition, then set Ai = 0 with the result that A 2 = E t _ip t — (to — y*) 
so defining: 

^t-iPt+i = m, - y* + [E t _ip t - (to - y*)\p (70) 

We can now use (67) to solve for E t ~ipt as: 

Et-iPt = (m-y*)(l-p) - 

1 + a 


£t -1 + PPt-1 


(71) 
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We can infer immediately from (71) that: 

EtPt+i = (m - y*)(l - /i) - -r^—£t +m (72) 

This can be verified by leading (67) one period, taking expectations at 
t, and repeating the operations in (68) to (71) but advanced one period. 

We have now completed step 2' and proceed to step 3, substituting 
Et-iPt and E t p t+ 1 from (71) and (72) into (66), to obtain after collecting 
terms: 


p t = (m — y*)(l — p) — y— e t -i + pp t 
+ rv 


-! + ■ 


1 + a — ap 


(73) 

This model and its solution are illustrated in Figure 2.5. The initial 
shock to demand, e t , shifts the aggregate demand curve out to DD along 
the SS, short-run Phillips, curve. The position of DD takes account of 
E t p t + 1 , the expected value of next period’s price level. This expectation 
solution is found by locating the unique stable path (the analogue of the 
algebra in equations (69) to (71)). The D*D* curve shows the combina¬ 
tions of (p, y) for which prices are not expected to change: the equation 
of the LM curve (62) is written as E t Ap t+i+1 = =A(m-E t pt+i - E t y t+i 
(i ^ 1). The S*S* curve shows the combinations of (p,y) for which 
output is not expected to change: the Phillips curve, (63), is written as 
EtAy t+ i = (p — 1 )(E t y t+ i — y*) (i ^ 1). 

The arrows show the implied motion of (p, y) where they take val¬ 
ues off these curves; the line with arrows pointing along it towards the 
steady-state equilibrium at the intersection of D*D* and S*S *, is the 
saddlepath, the unique stable solution. E t pt+i jumps from p t on to this 
line at the point where it intersects S'S', the expected vertical Phillips 
curve given by the gradual adjustment of y t back to y*; going through 
this point accordingly in an aggregate demand curve, D'D', whose equa¬ 
tion is m- E t y t+ i + aE t pt +2 = (1 + a)E t pt+ 1 - D’D' looks forward to 
EtPt +2 which can be found in a similar way as the point on the sad¬ 
dlepath intersected by E t yt+ 2 - Accordingly (p,y) are expected to con¬ 
verge at the rate p on (m — y*. y*) along this saddlepath, after their 
initial shock to ( p t , y t ). 

The basic method using the Sargent forward operator approach 

We proceed as above up to (68). Sargent now rewrites (68) as: 
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Figure 2.5: The solution of a fully dynamic model when a shock, St, disturbs 
equilibrium 


-1 


a 




D 


-1 


1 H - CK 


a 


■V 


1 “h Ct 


a 


/.iB 


Et-lPt+i 


= B 


7-1 


1 “I ~ Qi 


a 


(1 — pB)E t _ip t+ i 


1 T (x 


a 


1 - 


1 + a 


Now we can write (74) as: 

1 \(m-y*)( l-/z) 


B 1 (1- /j,B)E t _ip t+i (i ^ 1) (74) 

= (1 - nB)E t _ip t+ i (i ^ 1) (75) 


1 + aJ 

If we impose stability, this yields setting i = 1: 

E t _ip t+ i = pE t _ip t + (fn- y*)( 1 - /z) 


(76) 


which yields the rest of our solution as before. 

The Sargent method thus represents a convenient extension of oper¬ 
ator techniques to REFV models. ‘Backward roots’ (entering because of 
lagged adjustment) are projected backwards, that is, kept in the form 
1/(1 — pB), ‘forward roots’ (entering via expected future variables) are 
projected forwards, that is, transformed to 1/ ^1 — j^B^ 1 ^-, this pro¬ 
cedure, under the stability condition, gives us the same result as before, 
but in a very compact manner. 
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Sargent’s method is particularly useful for dealing with delayed sho¬ 
cks which are nevertheless anticipated from a date before they occur; 
so far we have considered only contemporaneous, unanticipated shocks. 
But, for example, it may become known now that the government plans 
to raise the money supply sharply in two years’ time for some reason to 
do with anticipated public finance difficulties. 

To allow for such a possibility let us in (64) allow e 4 to be a shock 
which may be related to previous events, whereas before it was assumed 
to be unrelated. Now moving through the previous steps of our solution, 
we find that (66) is the same. Taking expectations at t — 1, however, 
yields: 


(m - y*){ 1 -p) + E t _ l£t - list-! = -aE t _ 1 p t+1 

+ (1 -I- a + a/j,)E t -ip t - (p + afi)pt-i (77) 


and so: 


(to - y*)( 1 -p) + E t _iE t +i - pEt-^t+i-i = —aEt-iPt+i+i + 

(1 + a + ap)E t _ip t+ i - (p + a p)E t _ 1 p t+i _ 1 (i > 1) (78) 

Sargent’s (74) now becomes: 

- -(to - y*)(l - p) - -(1 - pB)E t - i£ t +i = 
a a 

(l-^^B^j(l-pB)B- 1 E t _ lPt+i (*> 1) (79) 

And (75): 

(to - y*)( l-p) + 

= (1 - pB)E t -\p t +i (i > 1) (80) 

The left-hand side of this can be written: 

^ OO 

(in — y*)(l — p) + ——-— ( a /l + o:)'(E t -is t+ i + j — pE t -is t +i-\+j) 

+ a 3=0 

which implies for the case of i — f the solution for: 

Et-ipt+i = pEt-iPt + (to - y*)(l - p) + 1/(1 + a) 

OO 

[a/(l + a)f (Et-\£t+j+\ — pE t -i£t+j) (81) 
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We can also use (77) to solve for E t -ipt as: 


Et-iPt = (m - y*)( 1 - n) + — E t -i£ t - 1 f £t -1 + MH -i + 

1 + a 1 + a 

^ 00 / \ J + 1 

TT^ 5Z ) ( E *~ i £ ^+i - ^-i et +*) (82 ) 

i=o N ' 

Now we see that the future shocks foreseen at f — 1 for t + j enter 
the expected solution for p t with a coefficient of 



Shock g t occurs at t=3, anticipated at t=0. Numbers show the solution at each date, t=0, 
1 , ... 


Figure 2.6: The effect of an unanticipated shock 

Hence the forward root is ‘thrown forwards’, acting as a weight on 
the foreseen shock which diminishes the further ahead the shock occurs. 
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This is illustrated in Figure 2.6 for a positive demand shock antic¬ 
ipated at time t for three periods ahead. At t (= 0 on the figure) the 
expected future shock to demand raises prices unexpectedly, increases 
supply (a movement along SS ), and stimulates demand (a shift in DD) 
because the rise in future prices relative to present prices reduces the de¬ 
mand for money, so increasing money expenditure. Demand continues 
to rise in t = 1 and t = 2 because future prices exceed current prices by 
a greater amount in t = 1 than t = 0 and in t = 2 than in t = 1; this 
is dictated by the dynamics (shown by the phase arrows) in that part 
of Figure 2.6. At t = 3, the shock occurs and increases demand further. 
Prices in t = 4 are expected to drop to exactly where they would have 
been (along the saddlepath) had the original shock to demand been an 
unanticipated one at t = 0 sufficient to stimulate output by the same 
amount as the anticipated shock did; so in t = 3 , there are conflicting 
forces on demand, the positive effect of the t = 3 shock more than offset¬ 
ting the effect from the expected future decline in prices. It is a useful 
exercise to assign numbers to the parameters and plot the resulting path 
as in figure 2.6. 

Muth method 

The Muth method, though not particularly intuitive, is probably the 
easiest to apply for this model. The general solution for p will be as 
before 

OO 

Pt = P + ^2 (83) 

i=0 

and it remains that: 

p = m — y* (84) 


Box 2.1 

The algebra for the illustration works out as follows if for convenience 
we set = Pt-i = y t - 1 = £t- 1 = e t = m = y* = 0: 

Pt = L 01 1 ] E tPt+ 1 from (65) 

1 + a + ^ 

EtPt +1 = (t^—)(t" 7—) 2 (f ~ P-rj—) £ t +3 + m from (82) 

1+a l+o l+o 

led by one period and taking expectations at time t so that: 
p t = qx t and E t p t+ 1 = (1 + pq)x t 
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where 


1 ,, a 


x < = ( t+7 ( t+ 7 (I - 'W £t+3 ' 


q = 77-rr 

1 “h CX — CXfl + 


Again from ( 82 ) 


7-1 7-1 1 CX r 9 1 H" 

^Pt+2 = pE t pt+ 1 H-= [/i + fi q H-Jz t 

a a 

1 -(- Qj 

E t Pt +3 = l*E t p t+ 2 d- X t = 

a 

[m 2 + M 3 9 + m(^-^) + (i_L^) 2 ] iTi 

a a 


EtPt +4 — P'EtPt +3 — M 


m 2 + m 3 9 + m(^) + (^) 2 - 

f l+tt \ 2 7 1+a \ 

\ a / \ 1+a—a/i y 


7+t+5 = VEtp t+ 4 


Both £+ t+ 4 and E t pt+ 5 are negative. 


We now substitute in (66) dropping constants to obtain the identities 
in the £t_$ from: 



( 85 ) 

( 86 ) 


(et-i): — A* = —a7T2 + (1 + a + ap)n\ — (p + ap,)iro 

(£t-i,i Js 2): 0 = -a7Ti + i + (1 + a + ap)n * - (a + a+7r,_i 
Applying the stability condition to the solution of (88): 

—) +A 2 p i - 1 (i> 2) 

a ) 



( 87 ) 

( 88 ) 

(89) 
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sets A = 0, so that: 

7 Ti = 1 (i > 2) 

Substituting this into (86) and (87) gives: 

1 + a — ay 

(1 + ck)( 1 + c* + | — ay) 


(90) 


(91) 


n = _ zR _ 

6(1 T a)(l A a ^ — Qy) 

We can easily verify that this is the solution arrived at previously. 


(92) 


A MORE GENERAL WAY OF LOOKING AT AN 
REFV MODEL 

We have been considering models with exogenous (‘forcing’) processes 
— here the money supply — that consist of a constant and a current 
shock. For most of the time we have assumed that the shock could 
not be predicted; here the unstable root was ruled out by our terminal 
condition. Then we looked at the case where one period’s shock was 
known some periods in advance; here we showed that the unstable root 
determines how this shock works back to affect the present — in effect 
the root is ‘thrown forward’ and becomes stable when working backwards 
from the future. It is time to generalise the solution method we have 
been using to any sort of exogenous process. 

To illustrate such a general method we take a variant of our earlier 
models: 


m t =Pt + yt- a(E t -ip t -i - Et-ipt) (93) 


Vt = y* + S(pt - E t -ipt) + y(y t -1 - y*) (94) 


which will be recognised as a Cagan-style money demand function with 
inflation expectations dated at t — 1 and a New Classical supply curve 
with persistence. Now let m t be an exogenous process of a completely 
general sort: each period there is a new realization of m t and a new set of 
E t m t+ i{i ^ 1). We will make no restrictions on how this set (m t , E t+i ) 
changes at each t. 

Consider the expectation at t — 1 of this model: 



a 

1 + a 


B- 


E t -im t = (1 + a) 


l 


Et-iPt + E t -iy t (95) 
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Tp * | c (Pt —1 Et —1 Pt —l) fnr\ 

Et-iyt = y + p6 - _ - (96) 

Let p t — Et-ipt = Vu we can easily solve for r/ t as 

Y^Ts( m t -E t _ x m t ) =T] t (97) 

by taking deviations from expected values across the model; this is a 
function of the purely unpredictable element (the innovation) in m t . 

Returning to our model above we can write the solution for E t _\p t 
as: 


Et-iPt 


where 


E t -im t * 

(1 + aXl-^i?- 1 ) V 

_ rfVt- 1 _ 

(1 + a ) (l-T TZB- 1 ) (1 -tiL) 

1 °° / \ i 

= ( 98 ) 

i=0 v 7 


Et-iyt+i = y* + psY^^Vt-i-j+i = y* + y l+1 (yt-i -y*Y, 

j=i 

note that the expectations of future innovations are by definition zero. 

Thus our solution automatically throws the term in a forwards (be¬ 
cause it relates the present to expected future events) and the term in 
p backwards (because it related the present and the expected future to 
past events). We can see that the general solution has a forward and a 
backward component, for each of which one of the roots of the model is 
appropriate. 

The model requires, for stability, that both | \-< 1 and | p |-< 1; 

both of these would be imposed as a matter of specification normally. 
Looking back over our previous discussion of uniqueness and will o’ the 
wisp variables, we can also see that in these conditions the terminal 
condition will both ensure uniqueness and rule out bubbles. 


Forward and backward roots: an examination 


We can see that in the model we have been using — with slight differences 
in dating of the expectations — we have obtained an equation of the 
form: 


—cuB + (1 + o, + exp ) — (1 + a)/if?] Et~\pt+i (99) 
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If this is solved backwards we write it as: 

... = -a 1 - + m) )B - )^ 2 B-'Et.m+i (100) 

The roots are then plainly l-^ 2 - and p,. These we recognise as the 
forward root, jyy, solved backwards and so inverted, and the backward 
root, n- 

If the difference equation is solved forwards we write it as: 

... = -p(l + a)\l-(+ -) B- 1 - (-?-) -B-A BE t . lPt+z 
\ 1 + a /z ) \l + aj p 

(101) 

The roots are then y and jyy which we recognise as the forward root 
solved forwards and the backward root, p, solved forwards and so in¬ 
verted. 

Plainly to obtain the appropriate solution as permitted by the ter¬ 
minal condition, we solve the forward root forwards and the backward 
root backwards as we have seen, obtaining: 

• •• = (1 + ct) (l — -—-—B (1 — P B)E t -iPt-\-i (102) 

V 1 + a J 

Equivalently we can obtain this appropriate solution by factorising 


— aB 1 + (1 + a + ap) — (1 + a)pB = fc 0 (l — k±B x )(l — k 2 B) = 

- kokiB^ 1 + ko( 1 + k\k 2 ) - kokoB (103) 
where the undetermined coefficients are given by 

k 0 h = a; jfeo(l + k\k 2 ) = 1 + a + ap-, kok 2 = /z(l + a) (104) 
Solving (96) for k\ yields: 


a 1\ , 

y— - 1 -) k\ 

1 + a pj 


T ^~)- = ° 
1 + a J n 


(the characteristic equation of the model solved forward) whence k\ = 

a 1 

1 + CK ’ ' 

Alternatively solve (104) for k 2 to obtain: 


o , 1 + a . 1 + a. 

k 2 - (- V p)k 2 - (- )p = 0 

a a 


(the characteristic equation solved backwards) yielding k 2 = P- 
To find the stable solution we select the stable values of each root, 
forward (k\). and backward (k 2 ). 
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STABILITY PROBLEMS IN RATIONAL EXPECTA¬ 
TIONS MODELS 


When we considered adaptive expectations models we were concerned 
about whether they were stable or not. Clearly these were backward¬ 
looking models and this question amounted to whether the roots, all of 
them backward, were stable; if they were not, then we would naturally 
assume the specification was wrong, since we look for models that are 
stable, reflecting what we take to be a stable reality (if it were not, 
it should have exploded — yet it hasn’t). With rational expectations 
models stability, as we have seen, involves in the two-root case both 
the forward and backward roots being stable when driven respectively 
forwards and backwards. If either is unstable, there is instability (either 
expected future events produce unstable current effects or past events 
produce unstable expected future effects). Let us consider them in turn: 
first, the case where the forward root is unstable — commonly known as 
the uniqueness problem. 

The Uniqueness Problem: the unstable forward root 


When the forward root is unstable, it is easiest to see what is happening 
in the simple model of equations (22), (8) and (9) at the beginning of 
the section on REFV models. Let us suppose that for some reason a in 
our model is negative and < —0.5. Suppose, for example, that there is a 
rigid relationship of money to average transactions in a period; and that 
precautionary transactions demand is positively related to the rate of 
inflation, because of the irregularity of price changes and the correlation 
between the size of these changes when they occur and the inflation rate 
(for example, I go to the doctor and find he had just put up his price by 
30 per cent). This is implausible but not impossible. 

Here we have, taking expectations at t — 1, looking at the solution 
forwards: 


m — y* 
1 cx 



—-— B~ x 

1 T cx 


Et-ipt 


(107) 


Plainly, the forward sum(m — y*)/ ^1 — jf^B does not converge. 
Equally if we look at the equation backwards we have: 


m — y* 
a 



E t -ip t +i 


(108) 


from which it follows that: 


1 + a 


E t -ipt+i = m — y* + A 


a 


(*D) 


(109) 
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where A = E t -ipt — (m — y*) 

This has a multiplicity of stable paths since | | < 1 : there is 

no unique stable path, hence the ‘non-uniqueness’ label of this case. 
Previously we used the stability condition to choose the unique stable 
path. However, now all the paths in (109) are stable, as shown in Figures 
2.7 and 2.8, because we have rigged it so that |i^| < 1. The stability 
condition is incapable of selecting a unique solution, therefore. This 
problem was first pointed out by Taylor (1977); and so far as we know 
there is nothing to rule out the possibility that REFV macroeconomic 
models will have an infinity of stable paths. 



Figure 2.7: The uniquness problem in (pt,Vt) space 

There is no generally agreed procedure among those using REFV 
models for this problem, other than to avoid using the ones with this 
property. One solution has, however, has been suggested by Minford et 
al. (1979) — to impose a terminal condition as we do in a normal model. 
Needless to say any ‘solution’ must somehow do violence to the model 
as specified since it is literally unstable. However, the economy may be, 
for some peculiar but genuine reason (as exemplified above), like this at 
least for a while: then could the terminal condition remove undesirable 
price level instability, as it did in the same model in its normal set-up? 
It turns out that it does indeed impose a unique stable solution. Thus 
we set: 


Et-lPt+N — Et-lPt+N+l 


(HO) 
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Figure 2.8: The uniqueness problem in ( pt,t ) space 

Using the forward solution we find that this implies 
tTT — y* ex 

- — = Et-lPt+N — - 7 —. - Et-lPt+N +1 ( 111 ) 

1 + a 1 + a 

whence E t -\Pt+N =m — y*. 

Via backwards recursion we obtain: 

E t _ lPt+N _! - 1 ±^E t _ lPt+N = (112) 

a 1 + a 

whence E t _ip t+i =m — y* (i ^ 0). 

Using the backwards solution, our terminal solution implies : 

+m ~ y * =A {~^r) +m ~ y * (H3) 

which is strictly valid only when A = 0. Thus also: 


Et-ip t+ i =m- y*(i ^ 0) 

Note that just as in the case of our normal model, a bubble can be 
added to this solution, namely Et-\Zt+i, but in this case it is an 
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‘imploding bubble’. The terminal condition rules this out here, as it did 
the exploding bubble of our normal case. The justification for such a 
condition might seem strained in this case. Yet upon consideration it is 
equally justifiable. Non-uniqueness (forward instability and imploding 
bubbles) must cause quite as serious problems as backwards instability 
and exploding bubbles. For the endogenous variables may in each pe¬ 
riod jump by unpredictably large (strictly unbounded) amounts; even 
though they will subsequently be expected to return to equilibrium, in 
all subsequent periods there will be shocks with infinite variance. Such 
uncertainty would be likely to provoke changes in behaviour sufficient to 
create an incentive for the money issuer to make a commitment such as 
is set out in the terminal condition. This commitment would then limit 
the uncertainty as we have seen, to that associated with the ‘most stable’ 
path — a result much in the spirit of a suggestion by Taylor (1977) that 
the least variance path will be selected by ‘collective rationality’ 3 . 

Let us now apply this same approach to the more complicated model 
we used above: 

m t =pt + y t - a(E t _ip t+ i - E t _ x p t ) (114) 

Vt=V* + $(pt - E t -ip t ) + p(y t _i - y*) (115) 

m t = ffi + e t (116) 

This yields a reduction in terms of prices and expected prices of: 


(m - y*)( 1 - p) +e t - pe t -1 =Pt~ ppt-i - aE t -ip t+ i + apE t _ 2 Pt 
+ 6{p t - Et-m) + aEt-ip t - apE t - 2 Pt-i (117) 

Let us assume, as we shall see we must, that |Ltsi| >- |/z|. First solve 
the model forwards and impose the terminal condition to obtain: 

Et-iPt+ N =rn-y* - p N+1 (y t _i -y*) (118) 

The backwards recursion proceeds: 

Et-m+N-1 = -^—E t -ipt +N + T~—i m ~ v* ~ 1 - y *)] 

1 + a 1 + a 

(119) 


3 There have been other suggestions, like Taylor’s, as to how society would select 
such a path. Peel (1981) argues that the monetary authorities will select a feedback 
rule generating uniqueness; however, it is not clear that they do select such rules in 
practice. McCallum (1983) argues that the solution chosen, when framed according 
to the Lucas undetermined coefficients method, will contain only the minimum set of 
state variables (his MSV procedure). This, we would argue, would be a result of the 
sort of government commitment we refer to. In practical terms MSV and terminal 
conditions deliver the same solution. 
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ultimately reaching: 


Et-ipt = m — y* — 

. N 
ay 


1 -b a 


1 


\ a 


(*) 


N—l 


l+o; J 

-SUL + 1 

1+c* ^ 


N—2 


n(yt-i-y*) ( 120 ) 


which, letting N be large (where for convergence |1^| y |/x|) 

-»*> < 121 > 

The same result is obtained using the backward solution; take expec¬ 
tations of (117) at t — 2 to obtain: 


— (1 —//). ,1 + a . 

-— (m - y ) = E t - 2 Pt+i - (-b p)E t - 2 Pt + 

a a 

p(^)E t _ 2Pt _ 1 (122) 

a 

whose solution is: 

Et-2Pt+i — m ~ 2/* + -^l(-)* +1 + 7l.2M I+1 (* ^ 1) (123) 

a 

with initial values E t - 2 pt and E t - 2 Pt-i- 

The terminal condition forces the constant on the root with the high¬ 
est modulus (that is, 1^) to be zero, whence: 

E t ~ 2 p t +i =m-y* + [Et- 2 Pt-i -(m- y*)K +1 (i > 0) (124) 

whence also: 

£ t _ip t+i+ i = m-y* + [Et^pt - (m — y*)]p l+l (i ^ 0) (125) 

Substitute for E t _ip t+ i from (125) into (114), take expectations at 
t — 1 of (114)—(116), and reduce to obtain: 

(m-y*) 

= E t _ip t + p{y t _i - y*) - a[(l - p)(m - y*) + pE t _ lPt ] (126) 
whence: 

Et-iPt = m-y* - 1 - -p(y t -i-y*) (127) 

1 + a(l — p) 


as with the forward solution. 
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There is no solution if -< |/z|. (123) now sets A 2 = 0 which 

gives: 

Et-iPt+i = (1- )(m -y )-\ - Et-ipt 

a a 

Repeating the operations which gave (127), we now find that: 

Et-iPt = -(yt -1 - y*) 

a 

implying: 

Et-m+i = —(E-t-i.yt - y*) = —(yt -1 - V*) 

a a 

When this is substituted again into (114)-(116) we obtain: 


Et-m 


m — y* 
1 -{- a 


+ 


m(i - p) 

1 “h OL 


(yt-1 


-y ) 


(128) 


There is therefore no solution for E t -iPt by contradiction. (If |"4r 1 | = 
\fi\. the terminal condition cannot impose a unique solution and (120) 
does not converge; so again there is no solution.) 

What we have seen is that provided is not too unstable (ie. 


provided that 


(X 

1+a 


-< 


1 


), the terminal condition will force a stable 


solution and rule out implosive bubbles. 


The case of an unstable backward root 


We can illustrate this case with the model we have just used. Here \/i\ >- 1 
in which case output is expected to explode — clearly an inadmissible 
model in general since the backward sums Set/ [(1 + <5)(1 — pL)\ do not 
converge. 

In this case, too, however, there is a reason for seeking some sort of a 
solution. Such a model cannot be ruled out for an episode (for example 
moving between two stable models in a general non-linear model - 
as discussed in the supply-side chapter a propos of virtuous and vicious 
circles and more generally in the Time-Series Annex). We therefore need 
to face up to the problems for inflation and monetary policy in such an 
episode. We have justified our terminal condition as a restriction placed 
on behaviour by the monetary authorities to prevent undesirable price 
outcomes. So here we ask if a terminal condition will produce acceptable 
price behaviour. 

We can in fact simply use our workings in the previous section, while 
noting here that a/(l + a) -< 1. Again we require for a solution that 
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(1 +a)/a y |/t|: that is, that here the backward root not be too unstable. 
Thus if a/(l + a) y 1 (our last case of an unstable forward root) then 
(1 + a) /a -< 1 -< \p\ and so there can be no solution with a terminal 
condition. 

However with (1 + a)/a y |/i| we obtain the same solution as above, 
namely: 

E t - 1 p t+i+ = m-y* - /L +1 1 + —^ ^ (y t -i - y*) (1^0) (129) 

This shows clearly that a terminal condition will solve for a price 
path, that it will be unique, but that it will be unstable matching the 
instability in output — we might label this as ‘controlled price instabil¬ 
ity’. 

What we have shown about models with stability problems is that, by 
introducing a terminal condition justified by monetary policy reactions, 
we will in many cases — where the instability is not too severe — find 
unique solutions: if the instability is too severe, there will be no solution 
at all under a terminal condition. In our particular model with two 
roots, ‘not too severe’ means (1 + a)/a y \/i\ . This implies that if 
both forward and backward roots are unstable there is no solution, since 
(1 + a)/a -< 1 -< |ju|. More general models with more roots have to be 
examined case by case if they have instability of either sort, using the 
analytical techniques we have described in this chapter. 


CONCLUSIONS 

This has been a chapter designed to equip the reader with the tech¬ 
niques to solve rational expectations models in a manner useful to ap¬ 
plied work 4 . We have shown how to use four main methods of solution: 
a basic method, both with the Sargent forward operator and with the 
model solved backwards, and the Muth and Lucas undetermined coef¬ 
ficients methods. We have also discussed the criterion for choosing a 
unique solution in these models, free of extraneous or ‘will o’ the wisp’ 
variables. The criterion we propose, namely that terminal conditions 
are imposed on the model (some external ‘transversality condition’), is 
widely accepted in practice The effect of this condition is to ensure a 
stable path free of extraneous variables or bubbles. Practical methods 


4 There are a number of descriptions of solution methods available in the literature 
(see, e.g., Shiller 1978; and the useful Aoki and Canzoneri, 1979). For more complex 
applications than those considered in this chapter, the reader will invariably use 
numerical methods on the computer. 
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of solution used vary (see for example Wallis et al., 1985). However all 
of them are approximations to the analytic bubble-free solution we have 
been setting out above. 
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APPENDIX 2A: WHEN ADAPTIVE 
EXPECTATIONS ARE RATIONAL 

Suppose that a series is generated by the ARIMA(0, 1, 1) process (see 
time-series annex at the end of the book): 

Vt = Vt-i +Ut — jut-i (1) 

where u t is a Gaussian white noise process and j a positive constant. 
The rational expectation E t _iy t of (1) is given by: 

Et-m = yt-i - ju t -i (2) 

From (1): 

yt - yt-i = (i - jL)ut (3) 

where L is the lag operator. 

Substituting for u t -i from (3) into (2) we obtain: 

Et-m = yt -1 i—- T —) 

1 ~]L 

so that: 

Et-m - jEt-m-1 = yt -1 - m -2 - j(yt -1 - yt- 2 ) ( 4 ) 

Subtracting E t _ 2 yt- 1 from both sides of (4) and rearranging we ob¬ 
tain 

Et-m - Et-m-x = K {y t - 1 ^ E t - 2 yt-i} ( 5 ) 

where k = 1 — j 

Notice also from (5) that: 

ny t -1 = (1 ~ tyyt-i 

^ lVt ( 1-(1 -k)L) {l-XL) 

= (1 - A)(l + XL + A 2 L 2 + A 3 L 3 + ... +)y t _ i (6) 
where A = 1 — k so that A = j. 

Equation (5) will be recognised as an adaptive expectations process. 
In other words if a variable is described by an ARIMA(0, 1, 1) process 
then adaptive expectations can be rational expectations if the coefficient 
of adaptation is equal to 1 — j. 

More generally we should note that a variety of mechanistic forms of 
expectation formation can be rational in a particular model structure. 
For instance regressive expectations are rational in the Dornbusch over¬ 
shooting model considered in Chapter 14. The point is, of course, that 
these mechanistic expectations mechanisms will, in general, cease to be 
rational if policy regimes change. 

Two interesting examples where adaptive expectations are rational 
are in models proposed by Sargent and Wallace (1973) and Mutli (1961). 
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The Sargent Wallace model of a hyperinflation 


The demand for real balances has the form: 


log 



07 rf + 7 Y + ip + u t a < 0 , 7 > 0 . 


( 6 ) 


where M is the demand for nominal balances (assumed equal to supply), 
P is the price level, 7 r| is the public’s expectation of future inflation, so 
7Tf = E t logPt +1 — log Pt , is assumed known. Y is real income assumed 
constant and Ut is a stochastic error term with an average value of zero. 
a, 7 and ip are parameters. 

Taking the first difference of ( 6 ) we obtain 


Mt = K t +a(ir e t -Tr^i) 


( 7 ) 


where ji t = log ( j 1 ^ is the rate of change of the money supply and 

77 = log is the rate °f inflation. It is assumed that u t — u t - 1 = r] t 

where r) t is a white noise error. 

Adaptive expectations (here current expectations of future inflation) 
can be written here as: 

e _ l 1 - A ) 7r * 

4 1-XL 


Substituting ( 8 ) into (7) we obtain the solution for 77 as: 


{[1 + a(l — A)] — [A + o:(l — A)]Z/}7T( 

= (1 - \L)nt - (1 - AL)(1 - L)u t (9) 


so that the current inflation rate is determined by distributed lags of 
changes in the money supply and of the disturbance in the demand 
function. 

To provide a rationalization of how adaptive expectations could have 
the rationality property, suppose the rate of monetary expansion is gov¬ 
erned by the process: 


Mi = 


(1 - A)7 i t 
1 - XL 


+ £t 


( 10 ) 


where e t is a white noise disturbance. 

Substitute from (10) into (9) for ji t to obtain 


{[A + a(l — A)] — [A + a(l — X)]L}nt — (1 — XL) [gy — (iq — ut- 1 )] 

( 11 ) 
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Equation (11) can be rewritten as 

[A + a(l - A)] (1 — L) 7 r t = (1 — XL) [e t - (u t - u t _i)] (12) 

Now 1 -xl = 1 — so that (12) can be rewritten as: 


7T t 


= [l l X) l t L l + [A + a(l - A )]- 1 [e t - (u t - u t - 1 )] (13) 


Recalling that Ut — lit -1 = Vt. the rational expectation of (13) is given 


by: 


Et-mt = ^-7 1 implying that E t ir t+1 = < = ^(14) 


l-XL 


1-XL 


In other words if the money supply process is (10) the adaptive ex¬ 
pectations will be rational. The question is why should the money sup¬ 
ply process follow (10) in a hyperinflation. It is typically assumed that 
in a hyperinflation period the authorities print money to finance their 
nominal expenditure, assuming the level of real government expenditure, 
G, is constant. This assumption is captured in continuous time (where 

*= f) by: 


M(t) 

Pit) 


= G 


or 


M __ PG 
~M ~ ~M 

In continuous time the demand for real balances is given by: 

M 

p=/[" e (*)] 

so that substituting (16) into (15) gives 


G 


M 


M /(tR) 


(15) 


(16) 


(17) 


Assuming a unique solution to (17) a linear discrete-time approxima¬ 
tion to (17) is given by 


Mi — E t TT t+ 1 + £f 


(18) 


where £t is assumed to be a white noise error term. 

Assuming rational expectations (10) is equivalent with (18) (since 
Et”t + 1 = 
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Consequently Sargent and Wallace have demonstated how adaptive 
expectations can be rational in a hyperinflationary period. 

We also note from (7) and (18) that 

EtTTt+l + Ct = 7Tt + Ot{lTf — 7rf— x ) -f (j) t (19) 


or 


tt* — 7T t + a{n e t - rf-i) + <t> t — £t 
If expectations are adaptive: 

e _ C 1 - 

‘ 1 - XL 

Substituting (8) into (20) for 7r®, 7r^_ 1 we obtain 
( 1 ^ A ) 7r * Inti n( 1_ A ) 7r t 

Tar =,r ‘ + “(i - L) -rnsr 


h~£t 


Simplifying (21) we obtain 

TT t - 7v t —i = - [(A + a(l - A) _1 (l - A L)(v t )\ 


( 20 ) 


( 8 ) 


( 21 ) 


( 22 ) 


where v t is the composite white noise error, vt = <f> t — £t 

The process (22) is an ARIMA(0, 1, 1) as required for adaptive ex¬ 
pectations of inflation to be rational. 


Agricultural prices 


The adaptive expectations assumption was widely employed in modelling 
of agricultural markets (see Nerlove, 1958) since models embodying this 
assumption were readily able to generate stable cyclical fluctuations. 
It is useful to illustrate how an agricultural model embodying rational 
expectations can also readily exhibit such fluctuations in price or how 
price series for storable commodities can approximately follow a random 
walk. 

We employ the model of Muth (1961). We write the model as follows 
(all means are put to zero for simplicity): 


►C5 

a, 

II 

1 

5? 

(23) 

q t = iE t _ m + u t 

(24) 

It = a(E t p t+ 1 -p t ) 

(25) 

qf + pit = q t + pit— i 

(26) 
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where qf, qf are quantity demanded and supplied, respectively, p t is 
price, I t is a speculative inventory and u t is an error process; a, (3, 7 and 
p are constants. I 11 this model it is assumed that storage of a commodity 
is possible, and that storage, transactions costs and interest rates are 
negligible. Consequently, a speculative inventory exists which depends 
on the anticipated capital gain from holding the stock. The parameter 
a, which measures the response of inventory demands to expected price 
changes, is a function of the degree of risk aversion and the conditional 
variance of prices in Muth’s exposition (also see Turnovsky, 1983). 

Because storage can occur, equilibrium does not require that current 
production (supply) equals current consumption demand. Equation (26) 
represents the market equilibrium condition. A parameter p (= 1 or 0) 
is introduced for analytical convenience: if storage occurs it is equal to 
unity; otherwise if we set p = 0 and a = 0 , we have the standard, no 
storage, market clearing model. 

Substitution of equations (23), (24) and (25) into (26) yields the 
reduced form: 


-flp t + pa(E t p t+ 1 - p t ) = jEt-m + pa{E t -ip t - p t - 1 ) + u t (27) 

Assume initially that u t is white noise. Solve this model under rational 
expectations using the Lucas method of undetermined coefficients. 

If we let 


p t = ap t -1 + bu t (28) 

for p = 1 we obtain by substitution 


b = 


1 

aa — (a + 0) 


and a = 1 + 0.5 


(P + l) 

a 


0.5 



(ff + 7) ' 2 

a 



(29) 


with 0 < a < 1 . 

The solution for prices (28) is of interest. Price exhibits serially cor¬ 
related fluctuations around its mean value when Ut, a variable which 
represents exogenous influences, is random. The reason for this is that 
inventories smooth out the effects of disturbances (shocks) to demand 
or supply. Consider, for instance, an abnormally good harvest due to 
favourable weather. I 11 a market without storage, the additional sup¬ 
ply will impact on market price in the current period. However with 
an inventory demand, speculators will buy some of the harvest, since 
the price in the future will, ceteris paribus, be greater than today, as 
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weather returns to its normal expected value. This procedure will gen¬ 
erally dampen price fluctuations. In addition the shock in the current 
period will have an impact in future periods which in this context is an¬ 
other way of saying that price movements will exhibit serial correlation. 

Equation (28) also leads to another insight, as pointed out by Mutli. 
As the importance of inventory speculative demands dominate a market 
relative to flow demands or supplies (as is likely over short periods of 
time) and consequently a becomes large relative to p or 7 then (29) 
implies that a will become close to one. Consequently, price will approx¬ 
imate a random walk. This is an empirical feature often noted in price 
series for storable commodities in high frequency data. We also note 
that in this model the rational expectation of price is a fixed multiple of 
last period’s price, though the coefficient is a function of the parameters 
of the model. In general, rational price expectations will be a function 
of lagged prices, though not in a mechanistic fixed fashion as occurs in 
an adaptive expectations model. Muth, however, did use a special case 
of the above model to illustrate how an adaptive expectations scheme 
could be rational. If we let p = 0 and a = 0 we obtain the reduced form 
of the market clearing model given by 


-0Pt = lEt-iPt + u t (30) 

Suppose that the error process is serially correlated so that 

u t = u t -1 + v t (31) 

where v t is white noise. 

Consequently the rational expectation of (30) is given by 

~{P + l)Et.-iPt = Ut -1 (32) 

If we lag (30) one period and substitute for ut-i in (32) we obtain 

-(/? + l)E t -\p t = ~/3p t - 1 - lEt-iPt-i (33) 

Adding (/? - 1 - i)E t - 2 pt-i to the left- and right-hand side of (33) we 
obtain after simplification the adaptive expectations scheme 

P 


E t -ip t - E t - 2 pt -1 = 

Notice from (30) and (32) that 


P + 1 


(Pt-l - E t -2Pt-l) 


n 7«t-i . (, lL 


u t 


Differencing (35) and substituing for Ut from (31) we obtain 


p t - p t -1 = ~P~ 


1 - 


lL 

P + 1 


v t 


(34) 


(35) 


(36) 
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so that as required prices follow an ARIMA(0, 1, 1) process. 

Finally it is interesting to consider the properties of a simple adaptive 
scheme when it is not rational. Consider the Wold decomposition for a 
variable 

OO 

Pt = Pt+ X niUt ( 37 ) 

i =0 

where p t is the mean component which could include a deterministic 
trend. 

For ease of exposition consider the simplest adaptive expectation, 
namely 

OO 

Pt = Pt- 1 = Pt- 1+ X ( 38 ) 

i =0 

The forecast error is given by 

OO OO 

Pt - Et-ipt = Pt+ X n i u t - (Pt- 1+ X 7r * n t-») (39) 

i—0 i= 0 

In the case where p is a constant so p t = p t _ 1; we observe that the 
adaptive forecast is unbiased, since the average value of the forecast error 
is zero. However the forecast is inefficient in general since the right-hand 
side of (39) being a function of past information implies that the forecast 
error is correlated with information known at the time expectations were 
formed. If p t contains a deterministic trend so that p t = a + bt, then p t — 
p t _i =6 so that from (39) the adaptive forecast will exhibit systematic 
bias and inefficiency. 
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Partial Information and Signal 
Extraction 


To this point, we have assumed that people had access to all relevant 
information on events up to a certain date in forming their expectations: 
for example E t x t or E(xt | $t) implied they know the whole contents 
of $ 4 , the information set relating to period t and before. If they did 
not have access to <f> t , then they might have access to the previous 
period’s information set. However, there is an intermediate possibility, 
which we have up to now ignored but which has important implications 
for the behaviour of people with rational expectations. This possibility 
is that they know a part of <f> t only, as well as all of $ 4 _i. For ex¬ 
ample, each individual might know the prices of certain goods which he 
or she trades (but not the general price level). This is ‘micro’ informa¬ 
tion. As for macro or aggregate information, in many economies capital 
market information, such as exchange rates or interest rates, is available 
essentially instantaneously. 

When endowed with such partial knowledge, agents face a statistical 
inference problem. Observation of the current values of macroeconomic 
variables, given knowledge of the variance of disturbances in the econ¬ 
omy, allows them to form an optimal expectation of the currently unob¬ 
served random variables using Kalman filter methods (Kalman, I960). 
In particular, if a variable z is observed which is the sum of two random 
variables (w, e) i.e.: 

z t = u t + e t (1) 


then the current expectations of u and e are respectively given by 

t2 


E t (u t ) = 


< 77 , 


On + 


-z t 


( 2 ) 


E t {et) 


;Z t 


( 3 ) 
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where <r^, a 2 are the known variances of the disturbances (Graybill, 
1961, has a fuller discussion). The coefficient on z t in (2) or (3) can be 
thought of as that of a simple ordinary least squares regression of u t on 
Zt (carried out over an infinite sample). 

The purpose of this chapter is to consider two examples of how partial 
information alters the solution of rational expectations models; these will 
both illustrate the method of solution and explain the workings of two 
models important in their own right. The first example is the Phillips 
curve in Lucas’ (1972b) ‘islands story’, where people know individual 
prices but not the general price level — a case of partial micro informa¬ 
tion. The second example is a general macro model where people have 
some capital market information — a case of partial macro information. 


THE NEW CLASSICAL PHILLIPS CURVE 

In the original formulation of the Phillips curve (Phillips, 1958; Lipsey, 
1960), as we saw in chapter 1, short-run rigidity of wages and prices and 
so market disequilibrium was assumed. The curve related the change in 
wages to a measure of excess demand at wages that were not clearing the 
market. This relationship was later ‘augmented’ by Phelps (1970) and 
Friedman (1968) with the addition of expected inflation: the idea was 
that people were bidding up wages at a rate expected to reduce excess 
demand but since excess demand would only fall if wages rose in real 
terms, nominal wages must be bid up by an amount equal to the expected 
necessary rise in real wages plus the expected rise in general prices. To 
the resulting wage equation was usually added a price equation relating 
prices to costs, from which a Phillips curve in price inflation could be 
obtained. 

The New Classical reinterpretation consists of five elements: 

1. The labour market is assumed to clear: there is no wage rigidity. 

2. People and firms observe prices in their own markets continuously 
but observe prices in other markets after a time lag. This im¬ 
plies that firms continuously observe the prices of their inputs and 
outputs, workers only their wages and some local prices (e.g. of 
groceries for which they continuously shop). Workers must form 
an expectation of the general price level. 

3. Workers have rational expectations and in particular use signal 
extraction to infer the current general price level from the local 
prices they observe. 
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4. Workers’ supply of labour has a substantial elasticity to current 
real wages; this is usually supported by the idea of intertemporal 
substitution of labour supply (although it could also occur when 
labour has a reservation wage for other reasons, such as unemploy¬ 
ment benefits or ‘shadow economy’ earnings). 

5. Firms continuously maximize profits; they relate prices to (marg¬ 
inal) cost, capital usually being taken as fixed, and they hire labour 
to the point at which its marginal value product equals its wage. 

These ideas were developed by Lucas in a series of papers (Lucas and 
Rapping, 1969, Lucas, 1972a, b, 1973). They can be seen as a response 
to the difficulties of integrating rational expectations into the original 
Phillips curve formulation: it is hard to see why rational workers and 
firms should permit rigidity of wages and prices. Later, as we shall see in 
chapter 4, Keynesian theorists attempted to overcome this difficulty by 
the assumption of nominal contracts of long maturity; that assumption 
is not easy to reconcile with the idea of voluntary contracting by agents 
free to exploit all opportunities for trade, and Lucas’ theory has the 
advantage of full consistency with that paradigm (one can also think of 
it as a model in which there are fully contingent wage contracts). 

Let us begin with workers’ intertemporal substitution. If one writes 
down a worker-consumer’s general utility function, including terms in 
both consumption and leisure now and in all future periods, the first- 
order conditions for a maximum will among other things set the marginal 
rate of transformation between present and future labour supply equal 
to the gross real rate of interest — for an application see Sargent (1979a, 
chapter 16). 

This is illustrated in Figure 3.1, where for simplicity the worker is 
assumed to have a fixed present value of consumption, c, which can be 
achieved in a two-period life by working either this period or next: his 
indifference curves between leisure (work) are tangent to the trade off 
between minimum current and future hours required to achieve c. c 
divided by next period’s expected real wages, uf +1; and multiplied by 
1 + r (the gross real interest rate) gives the number of hours needed to 
provide c by working entirely in t + 1 and borrowing against that income 
to consume in t. The tangency condition is: 

U't+i- w t = 1 

U[.w e t+1 1 +r 

where U' is the marginal utility of leisure. 

Hence for a normally behaved utility function, higher current real 
wages will have an income effect if ‘permanent’ real wages rise as well, 
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diminishing work effort, and a substitution effect (relative to future real 
wages), raising it. Lucas’ argument is that these movements in current 
real wages generally leave permanent real wages unchanged and so the 
substitution effect is dominant, and large enough to account for the em¬ 
pirically observed Phillips curve correlation between prices and output. 



Figure 3.1: Intertemporal substitution by workers 

Firms’ profit maximization leads, as discussed in chapter 1, to a 
downward-sloping demand for labour from the first-order condition that 
the marginal product of labour, /', equals the real wage, or /' = W/P , 
which in the simplest case where labour is the only variable factor also 
expresses the price = marginal cost condition, P = jr ; if / is a normally 
behaved production function, /'will fall as labour input increases. 

The complete Phillips curve derivation is illustrated in figure 3.2, a 
four quadrant diagram due to Parkin and Bade (1988), first introduced 
in chapter 1 as Fig.1.1. Quadrant (a) shows the labour ( L ) market: the 
supply by workers is conditional on expected (log) prices, p e (as well as 
permanent real wages, w*, and r, both of which we hold constant here), 
the demand by firms depends on their own actual prices, which they con¬ 
tinuously observe. Quadrant (b) shows the short run production function 
relating output (Y = exp y) to labour, capital being fixed. Quadrant (c) 
transfers the implied output to quadrant (d), which summarizes the re¬ 
sulting PP relationship between (log) prices, p , and output. This is 
illustrated for an increase of p from po = p e to p \, with workers contin¬ 
uing to expect prices of p e = Po- This raises labour demand to D'D' 
but leaves the supply curve where it is. Employment expands and more 
output is produced: the rise in prices has raised wages and so ‘fooled’ 
workers into thinking they are being paid higher real wages, so supplying 
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more labour to firms who are actually enjoying lower real wage costs. 



Figure 3.2: The New Classical Phillips curve 

Suppose pi is maintained in the next period and p e rises to this 
new level, then SS too shifts upwards. It will do so by exactly the 
same (vertical) distance as DD , namely a rise in W that is of the same 
proportion as the rise in p; this leaves both actual and expected real 
wages the same as at our starting point, and hence labour supply and 
demand will be the same as then. The resulting relationship between 
p and y, the ‘long-run’ Phillips curve obtained when p = p e , is shown 
as P*P* and is vertical: consistently with neo-classical theory in which 
people and prices care about real quantities and relative prices, having 
no ‘money illusion’, rises in prices when fully anticipated or perceived 
have no real impact. In this simple model with no lags or adjustment 
in production or labour supply the economy is at its natural rate when 
p = p e . 

Let us write the resulting Phillips curve relationship between y and 
p ( the log of respectively output and prices) as: 

yt -y* = ^(pt -p e t) ( 4 ) 

This immediately brings out the formal equivalence between the tra- 
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ditional Phillips curve we used in chapter 2, in which prices are the 
dependent variable, and the New Classical formulation here in which 
output is. Although the derivations are clearly different, the result is 
formally the same. 

Now let us graft on to this process the one assumption we have so 
far left out, signal extraction. Workers can use their current information 
on local prices they observe in their regular shopping to improve their 
expectation of the general price level. Denote the ith (of N) group of 
workers’ (log) local price as p tt and assume that it is governed by 

Pit = v it + p t (5) 

where v lt is a random error with variance of a 2 . 

The problem faced by the ith group of workers is to forecast pt — 
E(p t | from pa — E(p lt | <b t _i) where <!> t ~i is all last period’s 

data assumed available to all i groups by the current period. To do 
this, they compute the least squares regression of p t — E t (p t | 3> t _i) on 
Pit — E(p it | <f> t _i) and predict from it. 

We know that 

OO 

Pt = Xjct-i+p* ( 6 ) 

i =1 

oo 

E(p t | +P* (7) 

i =o 

and 


E(pu | d> t _i) = E[(p t + v it ) I $t-i] = E(p t | $ t _!) (8) 

since all i groups have the same t — 1 information and cannot predict Va 
from it. So our regression will be: 

7T0 e t =<t> o + et + Vit) (9) 

Assuming our workers had a large sample, so that we can ignore 
sampling error for simplicity, they will obtain (all i groups since a\ is 
the same for all Vu) </> 0 = 0 because e t and vu have zero means, and 

_ E(Tr 0 e t ,Vit) _ tt'qU 2 
E{n 0 e t ,v it ) 2 7 t 2 ct 2 -P <j 2 

Armed with this regression, the ith group’s current expectation of 
p t , Pt^ = E(p t | On, •f’t-i) where da is this period’s data available in 
the current period to the ith worker, now can be written as: 

Pt,i = E (Pt I $t-i) + HPit - E[p t | $t-i]) 


( 10 ) 
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Averaging expectations over all i groups gives us: 

'y . 

Pt = = (1 - 4>)E(p t | $ t -i) + 4>Pt (11) 

In passing, notice that p% implies an average expectation of e*. Since 
all i groups know that p t = 7Toe t + E(p t | <f> t -i), their expectation of e t 
will be given by: 


E(e t I 

7T0 

_ Pt,i - E (Pt I *t-i) _ <t>{ Pit - Ejpt | jt-i) (12) 

7T0 7T0 

Averaging (12) over all i gives: 


e 


e 

t 


(fro* , 

-= fat 

ttq 


(13) 


We may now integrate p ® into our New Classical Phillips curve: 


Vt - V* = ^(Pt - Pt) = ( ^-j^-[Pt - E{p t | $t_i)] (14) 

The dashed curve in figure 3.2 shows the resulting full New Classical 
Phillips curve: signal extraction steepens its slope, since now as actual 
prices rise workers react by altering p f. 

Let us work out the values of <j> and 7To for a simple model: 


m + e t =pt+yt (15) 

y t-y* = ^ s ^ [Pt - E(p t | $t-i)] (16) 

Using the Muth solution we obtain: m = 0 (i > 1) and 7To = 6/(6 + 
1 — (f)). The slope of the Phillips curve is and must lie between 0 
and A, since 0 < (/> < 1. 

With </> as given above, we can see that the solution for no and (f> is 
in general a cubic, with the extreme values of 7To = 1, (j> = 1 (vertical 
Phillips curve) and no = prj, <f) = 0 (the PP curve in Figure 3.2). 

The main implication of signal extraction is therefore that the Phillips 
curve’s slope depends on the behaviour of monetary policy. This de¬ 
termines 7Toe t in our simple models to this point; but even in more 
complex models where many influences impact on the ‘surprise’ term 
(pt — E(p t | $t_i)), including all demand and supply shocks, money 
supply shocks will still play a major role (a more complex model of this 
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sort in which the problem is to disentangle permanent from temporary 
money supply shocks is set out by Brunner et al. (1980). If the money 
supply is volatile, then workers will assume that the current movements 
in prices they observe are largely monetary in origin: (f> will be close to 
1 as 7 Tq of is large relative to a 2 v . The Phillips curve will be close to 
vertical. By contrast, countries with monetary stability will have flatter 
Phillips curves, with workers interpreting local price movements as pre¬ 
dominantly relative price movements. Lucas (1973) turned up convincing 
evidence of this in a large cross-country sample of Phillips curves; and 
much subsequent evidence has confirmed it. One particularly obvious 
way in which the Phillips curve steepens is through the spread of index¬ 
ation in countries with poor monetary control: indexation for a group 
of workers typically replaces p% by a weighted average of pfand p t (or 
Pt~k where k is made as small as possible given information-gathering 
costs). Ironically, though, in countries with extreme monetary volatility 
indexation is handicapped both by the necessary lag in information and 
by poor or even fraudulent government information about the general 
price level. 


THE SUPPLY SIDE: INTEGRATING MACRO 
WITH MICRO 

Most of this book is concerned with the behaviour of the economy in 
response to monetary shocks, in the sense of shocks which disturb the 
absolute price level. These set up reactions through the Phillips or sup¬ 
ply curve on output and so on other variables, for example in the way we 
have just examined. Yet there is a wide variety and scale of real shocks, 
which, regardless of their effects on the price level, cause important ef¬ 
fects on the economy. One of the key changes in our thinking produced 
by the rational expectations hypothesis has been a renewed emphasis on 
the ‘supply side’; that is, the mechanisms through which the economy 
responds to real shocks. 

One branch of rational expectations research, real business cycle the¬ 
ory, dismisses monetary shocks altogether as a source of variation in 
output, explaining it entirely in terms of real shocks (e.g. Kydland 
and Prescott, 1982; Long and Plosser, 1983); these economists argue 
that people have sufficiently up-to-date information on the price level to 
avoid being fooled as in the New Classical supply curve and that any 
contracts they sign are fully indexed to the price level, again avoiding 
any real effects of unexpected price changes as in the New Keynesian 
supply curve. The price level on this view will vary if the money supply 
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increases (e.g. through expansion in bank credit and deposits) but this 
will have no real effects. Only if there is some shock to payments tech¬ 
nology which disturbs real plans (for example, by credit controls creating 
a ‘credit crunch’) will monetary shocks have a real effect: but this is not 
a normal money supply shock in our terms. 

The real business cycle school may or may not be going too far in 
denying any effect of monetary shocks: testing its assertions is diffi¬ 
cult because it can in principle account for the same correlations we 
observe, such as that between prices and output in the Phillips curve, 
by appealing to reverse causation — real shocks move output, which 
induces monetary expansion, which raises prices. Nevertheless, what is 
undoubtedly important is the focus on real shocks and the supply side 
as of primary interest to macroeconomists. We have just set out a basic 
model of the supply side, to explain the full New Classical model. This 
model can be developed to explain unemployment in terms of people’s 
voluntary choices confronted with the opportunities they face (not nec¬ 
essarily attractive ones of course); a further factor which may frustrate 
their choices however is the power of unions. Analysis of unemployment 
along these lines for the UK is to be found in Minford et al. (1983) and 
Layard and Nickell (1985), for Germany in Davis and Minford (1986) 
and for a variety of other countries in Bean et al. (1986). In chapter 9 
we discuss these supply-side issues further; and in chapter 11 we set out 
a full real business cycle model. 

Rational expectations has re-united macro- and microeconomics into 
a single subject. Keynes (1936) divided off, indeed created, the subject 
of macroeconomics with its own aggregate laws, not derivable from micro 
behaviour and subject to regular aggregate ‘market failure’. Since his 
intervention we have learned much about aggregate behaviour, which 
had never previously been much studied by the classical economists. 
Essentially, rational expectations has enabled us to account for macro 
behaviour in terms of micro laws. 


CAPITAL MARKETS AND PARTIAL MACRO INFOR¬ 
MATION 

We now consider our second example of signal extraction. Here we as¬ 
sume that there is no useful local information (a^ is large relative to 
<7g) but that there is current macro information from capital markets. 
Clearly a relevant model of most economies will contain both sorts of 
information but it helps our exposition to focus on each separately. 

Our illustrative example supposes that people know the interest rate 
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currently. They wish to derive from this estimates of other current 
macroeconomic variables — the price level and output, and so on. They 
do this just as in the local information case by prediction from a re¬ 
gression of these variables on the interest rate, any variable x t being ex¬ 
pressed as x t — E(xt | $t_i). Again, as in the local case, the regression 
parameters enter the model through their effect on the expected vari¬ 
ables: there is an additional feedback in the model from current events 
to expectations, altering the impact effect of shocks to the economy. 

This is a much more complicated signal extraction problem than the 
earlier one we considered in the Lucas supply curve where the general 
price level had to be extracted from the local price. There the only in¬ 
formation people have is the local price and the equation relating this to 
the general price (5). Here they have information on a ‘global’ (economy¬ 
wide) variable, interest rates, and all the equations of the global (macro) 
model. Hence any view they form of the shocks driving the global inter¬ 
est rate must be consistent with what the model would produce; it must 
also be the case that the actual shocks (which will in general differ from 
what they expect) must via the model produce this same interest rate. 
This is a highly complicated consistency condition. 

To get an intuitive understanding of what is happening, it is useful to 
consider diagrammatically how the economy behaves when this sort of 
signal extraction is going on; and to compare it with the usual situation 
we have discussed up to now where no global signal extraction is going 
on. 

It is of some interest to compare the reaction of this ‘economy’, based 
on the model of equations (17)-(20) below, when R t is known with that 
when R t is not known. Table 3.1 shows the reactions of output to the 
three shocks as derived below from that model. When R t is not known, 
all shocks have ‘normal’ positive effects on output. However, the sizes 
of the coefficients are quite different when R t is known. This is hardly 
surprising since now output responds both to expected shocks and to 
the difference of shocks from their expected levels. Now even the sign of 
effect can be different for the various shocks. We can understand this as 
follows. 



u t 

v t 

e t 

y t ( R t not known) 

W 


aW 

y t (R t known) 

0U + T& 

<t>e ~ a <Pu 

a ^ + TT7 

where W = 1+a ^ +m 





Source: Model equations (IT)—(20) 


Table 3.1: Output Reactions to Shocks 
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Suppose the noise in e t dominates; then <j> e —► 1, (</> u , <j> v ) —► 0 
and u t has no effect because E t u t —» 0, E t v t —» 0. Suppose noise in v t 
dominates; then (</> e , 0 U ) — » 0 and v t has no effect. But suppose noise 
in u t dominates; then E t Vt —> 0, — » 1, so that a ( Vt +^ tUt '> — _ G . 

Hence a demand shock has a negative effect on output if supply shocks 
predominate, because agents misinterpret the effect of the positive de¬ 
mand shock on interest rates as that of a negative supply shock; expected 
prices consequently rise more than actual prices and supply of output is 
reduced. 

Similar ‘peculiarities’ can occur in the reactions of p t and f? t ; Table 
3.2 documents them. It is worth stressing therefore that the economy’s 
behaviour in response to shocks can be ‘paradoxical’ if the shocks are 
‘misinterpreted’. Such effects are well known at the level of everyday 
comment (cf. the behaviour of the UK economy in 1980, when the interest 
rates were interpreted as responding to ‘overshooting’ of its target by the 
money supply; subsequently it turned out that the money supply, truly 
measured, had contracted substantially). It is of interest that they can 
be rationalized within a stylized framework. 

One such case is illustrated in figures 3.3 and 3.4. In figure 3.3, it is 
assumed that a negative monetary shock, e t , occurs but that R t is not 
currently known; so normal results are obtained. The LM curve shifts 
leftwards in the upper (R t ,y t ) half, shifting left the aggregate demand 
curve in the lower (p t , y t ) half. Prices and output fall and the interest 
rate rises; expectations of course are undisturbed. 


u t v t e t 


Pt(Rt not known) 

-(l + f)IU 

w 

Pt(Rt. known) 

-s 

(1 + a)S — 1 1 — aS 

Rt{Rt not known) 

-<LW 

(l + f)W/a -S.W 

Rt{Rt known) 

A 

B D 

C — a+c x 

| c+a(l+c) 

■6 + c A-u) x 


Source: Model equations (17)—(20) 


Table 3.2: Price and Interest Rate Reaction to Shocks 


In figure 3.4 the same shock occurs when R t is currently observed (so 
that E t p t now reacts); we illustrate the paradoxical case just discussed 
where people expect only variations in the supply shock, u t , to be of any 
significance, so they misinterpret the shock as a negative supply shock. 
The left side of figure 3.4 shows actual outcomes, the right side shows 
expected ones: across the two sides expected outcomes are of course the 
same. The expected outcome is a rise in the interest rate and in prices, 
together with a fall in output because of a negative u*. The rise in E t pt 
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Figure 3.3: The case where Rt is not currently observed and a negative et 
shock occurs 


shifts both the expected IS and LM curves, IS e and LM e , along the 
ISLM e curve which plots their intersection as Ep t changes. In ( E t p t , 
E t y t ) space, the AS curve shifts leftwards from the fall in E t u t , along 
A e D e , the expected aggregate demand curve. 

The actual outcome can be broken down into two parts: the shifts in 
the curves because E t p t rises, and the shifts because e* falls. The rise in 
E t p t shifts the AS 1 curve up and the AD curve leftwards in (p, y) space, 
the latter being produced by the IS curve shifting leftwards in (R, y) 
space. The fall in e* shifts the LM leftwards in ( R , y) space and so 
the AD leftwards in (p, y) space. These various shifts are shown on the 
graphs by labelling each curve according to the value of et, p t and Ep t . 

In effect, the expected supply shock drives expected prices up (be¬ 
cause of fears of shortage) so that even though prices may fall by much 
the same as in the normal case, the contraction in output is much greater. 
In this example, the misinterpretation of the monetary shock has se- 
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A negative e, shock occurs, but a 2 v , a 2 e are small so that a u, shock is expected (cj> u ->1). 
The £ t shock is interpreted as a 4 shock (E t p,+i = p*, i.e. p = 0) 


Figure 3.4: The case where Rt is currently observed 


riously worsened recession (somewhat reminiscent of the 1980 British 
recession). 

We have seen how the economy behaves. But how in detail does 
one extract a signal about the shocks in this economy? First, we need 
to know the variance of the different shocks (we assume for simplicity 
that they are unrelated; if not, we also need their covariances): this tells 
us the unconditional probability distribution of each shock. Second, we 
need to know the effect each shock would have — via the model — on 
interest rates: with this extra knowledge we can extract the probabil¬ 
ity distribution of each shock conditional on the observed value of the 
interest rate, and so the conditional expected value of the shock. 

Plainly, as we have seen in the diagrammatic example, the expected 
values of the various shocks influence the behaviour of the economy: in 
our example the shock expected was overwhelmingly the supply shock 
and this caused the economy’s agents to expect a high price level when 
hit by a demand shock. Hence the effect of the shocks on the economy 
depend on the signal extraction people make; and so the effects and the 
signal extraction formula are set simultaneously by the structure of the 
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model and the variances of the shocks. We now show one method for 
solving out what they are within the sort of model we have been using 
hitherto: this method has the benefit of showing transparently the in¬ 
terdependence between signal extraction, the model structure and the 
shock variances. In practical applications, however, we would recom¬ 
mend the use of a computer algorithm (developed in Matthews et al., 
1994a, b). 

To give our example enough complexity to be of interest we move 
to a fairly general macro model. We express all variables in deviations 
from equilibrium; so all constants such as to and y* are dropped. For 
short, E t xt is used to represent the current expectation conditional on 
last period’s full data and this period’s partial data (consisting of Rt, 


the interest rate). The model is: 

Vt = a{p t - E t pt ) + u t (17) 

m t =Pt+y t - cRt (18) 

m t = £im t _ i + e t (19) 

Vt — —oiRt + aE t p t+ 1 — E t p t + i>t (20) 


Ut, Vt and et are independent random shocks with known variances <j\, 
a 2 and a 2 -, (17) is aggregate output supply; (18) is money demand; (19) 
is money supply; (20) is demand for aggregate output. 

To solve this model, we first obtain a basic equation in the errors as 
deviations from their expected values. Taking expectations of (17) and 
subtracting the result from (17) yields: 

yt ~ Ey t = a(p t - E t pt) + u t - E t u t (21) 

Equating (18) with (19) and following the same procedure as with 
(17) yields: 

Vt — E t et = Pt ~ E t pt +yt~ E t yt — c(R t — E t R t ) (22) 

Equation (20) analogously gives: 

Vt - Ey t = -{Rt- ER t ) + v t - E t v t = v t - E t v t (23) 

since R t = E t R t , Rt being known. Simple manipulation then yields our 
basic equation in the errors: 

(1 + a)(vt — E t v t ) = a(et — Etet) + Ut — EtUt (24) 

This formal restriction across the expected and actual values of the 
errors results from the fact that people know Rt. Therefore the actual 
solution of R. t (based on the actual and expected errors) must coincide 
with the expected solution (based only on the expected errors). The 
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parameters a and c drop out because when deviations from expectations 
are taken these multiply variables that must be zero (since R t = E t Rt , 
E t (E t P t ) = E t P t , E t (E t P t+1 ) = E t P t+1 ). 

This linear model has a general linear solution for R t in terms of the 
current shocks and lagged information: 

Rt = Au t + Bvt + De t . + Zmt—i (25) 

A, B , D and Z are the coefficients to be solved for. Consequently, 
using the Graybill (1961) formula, the best estimates of u t , v t , e t , given 
R t — Zm t -i are: 

Et.Ut = —<p u (Aut + Bi>t + Det ) (26) 

EtVt = —(p v (Au t + Bt>t + Det) (27) 


where 


and 


Et&t — -jj^eiAiit + Bvt + Det ) 




x ’ 




B 2 al 

X ’ 


X 


(28) 


X = dV + B z cri + T> J cr 


2 

e 


Substituting (26), (27) and (28) into (24) we obtain: 

(1 + a)v t - ae t - u t = {(1 + o)^ - ^ - ^}(Au t + Bv t + T>e t ) 


(29) 


Since v t , e t and u t may each be any real number, (3.29) is only satis¬ 
fied, in the usual manner of undetermined coefficients, if the coefficients 
on each alone add up to zero, that is if for example 


1 = 


aB 


B 


(1 + a)D e (1 +a)A 


<t> u (on v t ) 


(30) 


Further, since we know that <j> v + </> e + <t> u = 1 it follows at once 
similarly that 


D D a A 
A =a 'B = ~1 +a’B 


1 


1 “h Cl 


( 31 ) 
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Consequently 


where 


_ ai. x _ « 2cr e 

Pu ’ < P y x' ' x' 


X' = a 2 u + (1 + a) 2 a 2 + a 2 a 2 


The full solution of this model can now be found. From (17), (18) 
and (19) we have: 

Pt + a{Pt - E t p t ) +u t - cR t =e t + pm t - i (33) 

from which we obtain, taking expectations and using R t = E t Rt'- 

Rt = ^( E t p t + E t u t - E t e t - pm t - 1 ) (34) 

Equating (17) and (20), taking expectations and substituting for R t 
from (34) we get: 


EfU t - E t v t = — — (E t pt + E t u t - E t e t - pm t -i) - aE t p t + aE t p t +i 
c 

and rearranging: 


EtPt+i 


OL ~\~ C 1 1 

E t Pt = - : E t u t - E t v t - E t e t - p/cm t -1 

etc a c 


Following Sargent’s procedure (see chapter 2), we write the left hand 
side of (35) as 

[l-T^- B-AEtPt 

C l 1 + c 

where B is the backward operator instructing one to lag variables but 
not the expectations date (e.g. BE t Pt = E t P t ~ i). Equation (35) can 
now be written as: 


E t pt = 


i +c) 

( ( —^—Eu t - -Ev t - -Ee t - \ = 

\ ac a c c J 

OL C C 1 

-FT7— \E t Ut H- jt— — -rE t vt + —— E t e t 

a(l + c) a(l + c) 1 + c 


1 + c -f-' 


Etrrit-i+i (36) 
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Since E t m t -\+i = m t - 1 , pm t -1 + E t e t , p 2 m t -1 + pE t e t , ... for i = 
0,1,2,... (36) becomes: 

EtPt = + 


a(l + c) 


a(l + c) “ l + c(l^)' 


+ 


M 


1 + c(l — fl) 


whence: 


Etpt+i = 


_ p 2 m t -1 + pE t e t 


1 + c(l — //) 

To find substitute from (37) into (34) to obtain 

Rt = ~~ E t u t + -J—E t v t - E t e t - 

a(l + c) a(l + c) 1 Cyl — fi) 

M(l-M) 

1 + c(l — /r) 


?n t _i (37) 


(38) 


We can write 


E t ut = 4> u QuE t vt = -r^Q t ;E t e t = ^Q t 


in t -1 ( 39 ) 


(40) 


where 

Hence 
A = 


Qt = ut - (1 + a)v t + ae t 

1 — ot M 1 x 1 - /i 

'a(l + c)^“ “ a(l + c)(l + a)^ “ a[l + c(l -/*)]' 
B = —(1 + a)H; 


D = aA (41) 

What we see therefore in these solutions is a striking complexity in 
the contemporaneous response of the economy to shocks. That com¬ 
plexity is due to the fact that the shocks go through an ‘interpretation 
filter’ — signal extraction — which modifies their ‘direct’ effect, that is 
their effect in a world where people have no global information. Thus for 
as long as people do not have full information, the course of the econ¬ 
omy is contemporaneously influenced by what they think are the shocks 
driving it. This influence persists after they have discovered what the 
shocks were, because of the lagged effects in the model: however of course 
this persistence also applies to the direct effects of the shocks. We can 
therefore describe global signal extraction as a further contemporaneous 
transmission mechanism of shocks, over and above their direct transmis¬ 
sion mechanism. 
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CONCLUSIONS 

We have seen that signal extraction has potentially important effects 
on economic behaviour. How people interpret shocks conditions their 
behaviour whether in the slope of the Phillips curve or more generally. 
In this chapter we have illustrated these effects and shown how models 
can be solved allowing for the signal extraction feedback. In subsequent 
chapters we shall be applying the techniques of chapters 2 and 3 to the 
analysis of a variety of policy issues. 
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Stabilization Policy — Can we? 


In this chapter we will outline the implications for stabilization policy 
if people form their expectations rationally. Stabilization policies are 
typically defined as policies aimed at reducing (usually the variance of) 
output or employment deviations from their full-employment (‘natural’ 
or ‘equilibrium’) levels. This is the definition we will use primarily, 
though we will occasionally widen it to policies designed to stabilise a 
wider menu of macro variables (such as inflation). 

We will consider government use both of the budget (tax rates and 
spending- ‘fiscal policy’) and of the money supply (‘monetary policy’). 
We will treat a monetary policy of interest rate setting as equivalent to 
some money supply policy (rather like in trade theory import quotas are 
treated as equivalent to tariffs which vary with market conditions). 

As this point is important (given that central banks usually do use 
rules for setting interest rates) and not immediately obvious, let us con¬ 
sider two examples of this. In the first case, suppose the interest rate 
behaviour of the government or central bank (henceforth ‘the govern¬ 
ment’) is 

Rt =a(y t -i -y*)+/3(m t -m*)+ri t (la) 

Then we can rewrite it as a money supply rule: 

m t = m* + j^R t - jjiVt-i - y *) - (lb) 

In the second case, assume that the interest rate behaviour does not 
directly include the money supply. Here we have to introduce other rela¬ 
tionships in the model to determine the implied money supply behaviour. 
So let 


Rt = a(y t -i - y*) +y t (lc) 

and suppose the demand for money is simply mt = —f3Rt 
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Then the implied money supply rule is 

m t = —a/3(y t -i - y*) - f3t] t (Id) 

In other words we can rewrite any interest rate behaviour as an im¬ 
plied behaviour of the money supply, and regard the government as 
obeying that ‘money supply rule’. 

The vexed question now arises of what government policies are ‘rules’, 
which ‘discretionary’, and which ‘fixed rules’. However from the view¬ 
point of rational expectations we can distinguish between that part of 
government behaviour which is a surprise (cannot be predicted from 
known past events) and that part which is a predictable response to past 
events; furthermore within the surprise element we will distinguish be¬ 
tween that part which responds predictably to contemporaneous events 
and that part which is contributed by the government’s own unpre¬ 
dictability. The description of the predictable part of the behaviour 
is the ‘rule’ governing its behaviour; the rest is the ‘surprise’ element in 
policy. Within the rule element, the government could respond to past 
events actively or it could refuse to respond at all; the former is often 
called a ‘feedback’ response, the latter a ‘fixed target’ rule. Perhaps the 
best-known example of this last is Milton Friedman’s proposal that the 
government should adopt a policy of a fixed growth rate of the money 
supply (Friedman, 1968). 

It is possible for feedback rules to be simple or complex, depending 
on exactly how governments set about stabilization. In the 1970s for 
example it was quite usual for governments to make detailed forecasts 
every so often of what would happen with different settings of policy 
instruments; and then to adjust the settings to obtain the most desirable 
forecast — ‘optimal control policy’. This is sometimes referred to as 
‘discretion’ or ‘fine-tuning’. 

The relationship between past events and policy reaction resulting 
from this process would be complex; but it could in principle be written 
down. More recently governments have lost faith in such forecasting 
procedures (for reasons we discuss in chapter 6) and have used simpler 
rules of policy reaction — to one or at most a few lagged variables. It 
is rules like these we shall focus on as typical of feedback response or 
‘flexible rules’. (Notice in passing that flexibility cannot, as some might 
think casually, consist in a government changing its mind every period 
on how it will react to past events; if it did, it would have entirely 
unpredictable behaviour — policy would be a pure surprise.) 

We will also examine the government response to contemporaneous 
events. It is usual to suppose that governments can observe these no 
better than any private person (we will also consider the case where it 
has superior information below). However, there are ways policy can 
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respond to events without the government directly observing them: we 
describe these as ‘automatic’ responses. For example taxes (and ben¬ 
efits) respond to incomes through the tax/benefit system, without the 
government doing anything (i.e. changing tax or benefit rates). Simi¬ 
larly, the money supply (or interest rates) may respond automatically 
and simultaneously to income under the particular rule in force, without 
a contemporaneous decision by the government. These policy responses 
may well be important stabilizers (or indeed destabilizers). 


MODELS WITH THE SARGENT WALLACE SUPPLY 
CURVE 

An early result, due to Sargent and Wallace (1975), is that stabilization 
policy has no impact on either real output or unemployment in classical 
equilibrium models if they embody a supply function relating deviations 
of output to surprise movements in the price level, and further that both 
private and public agents (a) have identical information sets and (b) are 
able to act on these information sets. We discussed this result briefly 
in chapter 2 in the context of a simple monetary model, which we now 
extend somewhat. 

Consider the following simple model: 

Vt — —a(Rt — Et-iPt.+i + Ef-ipt) + — y*) (2) 

Vt = V* + flipt - Et-ipt) (3) 

m t =p t +yt- cRt + v t (4) 

m t = m + -y*) +u t (5) 

a, (3, Pf, p rn , and c are constants (pj, p m would typically be negative in 
Keynesian policy rules), Ut and Vt are random errors, Rt is the nominal 
interest rate. 

Equation (2) is the aggregate demand schedule. It includes a fiscal 
feedback response Pf{y t ~\—y*) representing government counter-cyclical 
variations in spending or tax rates. 

Equation (3) is the Sargent and Wallace Phillips (or supply) curve. 
This is derived as in chapter 3. The only difference is that it assumes 
that people can obtain no useful information about the general price level 
from their current local prices; so there is no signal extraction in this 
case — hence the dating of expectations at t — l. The same assumption 
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is used throughout the model. It turns out, as we shall see later, to be 
an important restriction. 

Another way of looking at (3) is as an ‘old’ Keynesian expectations- 
augmented Phillips curve in which inflation equals expected inflation 
plus an effect of excess demand proxied by y — y*: we can rewrite it as 
7r t - E t _ i7r t [= p t - Pt-i - {Et-iPt ~ Pt-i)} = jj(Vt - V*)- Sargent and 
Wallace stressed that their result could be viewed as an implication of 
orthodox Keynesian models, if rational expectations were substituted for 
adaptive expectations. 

Equation (5) is a money supply rule with a feedback response 

y m (yt ~i -y*) 

On substituting (2) for R t into (4) and equating (5) to the result, we 
obtain: 

Q 

rn + p(y t - 1 - y*) + w t = Pt + (1 + —)y t - c{E t -ip t +i - E t -\p t ) (6) 

a 

where wt = ut — v t and p = [M/(a) + Mm]- Substitution of (3) into (6) 
for y t and y t - 1 yields : 

Q 

m + f}p(p t -i - E t -2Pt-i) +Wt=pt + ( 1 + -)l3(pt - Et-iPt) 

a 

— c(E t -ipt+i — Et-iPt) + (1 + — )y* (7) 

a 

To solve (7) for prices, we use the Muth solution method discussed 
in chapter 2, writing: 

OO 

Pt = P+ ^2 ( 8 ) 

»=o 

We find that the identities yield: 

P = m- (1+ -)y* (9) 

a 

(terms in w t ) 1 = 7To[l + /3( 1 H-)] (10) 

a 

The identities in the other errors are irrelevant for our purposes here. 
Since 


Pt - E^m = 7r 0 w t (11) 

substitution in (3) yields 


Vt = y* + (3n 0 w t 


(12) 
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From (10) we see that 7To does not depend on either /i, m or and 
consequently we see from (12) that systematic monetary policy does not 
influence the variance of output in this model. Unanticipated monetary 
change is of course equal to rri t — E t _im t . 

Since = m + ym(y t - 1 - y*), 

m t - E t -init = in (13) 

which is a component of w t . Consequently, unanticipated monetary pol¬ 
icy does influence output in the Sargent-Wallace model but not antici¬ 
pated monetary policy. 

This result stems from the nature of the supply curve. Output is set 
by supply considerations (relative prices, technology, producers’ prefer¬ 
ences, etc.) and is only influenced by macroeconomic events if these 
cause surprise movements in absolute prices which in turn are partially 
(mis-) interpreted as relative price movements. Government by defini¬ 
tion cannot plan surprises (if it tried to, the ‘surprise’ would be — under 
our assumptions here — part of available information at i — 1 and so 
would be fully anticipated, and no surprise at all); its feedback responses 
are planned variations in net spending or money supply. 

A basic extension of the result occurs if there are adjustment costs 
in supply; allowance for these in a standard way (e.g. a quadratic cost 
function) adds a term +j(y t - 1 — y*) to (3) (0 < j > 1). A shock to 
output now persists, and in principle the business cycle in output can 
be accounted for by the interaction of a variety of shocks with such 
a ‘persistence mechanism’ (various forms of it have been suggested by 
Lucas, 1975; Sargent, 1976; Fischer, 1980b; Barro, 1980). 

Even though a macroeconomic shock now affects output for the in¬ 
definite future, it is still impossible for fiscal or monetary feedback rules 
to affect its variance because they can neither affect the impact of the 
shock itself, being a surprise, nor alter the adjustment parameter which 
determines the lagged effects, this parameter being fixed by technology, 
etc. We leave the demonstration of this — by substituting for y t and 
y t - 1 in (7) from the new supply curve in (3) — as an exercise for the 
reader. 


DIFFERENT INFORMATION SETS 

It is crucial for this neutrality proposition that, even in a model embody¬ 
ing a Sargent-Wallace supply curve, both private and public agents have 
the same information set. 

If, for example, the government has an information superiority, then 
it can use this to modify the ‘surprise’ faced by the private sector. For 
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suppose private agents have access only to last period’s data in the cur¬ 
rent period, but the government knows the true price level (assume it 
collects price statistics over the period and waits before releasing them). 
Then it may in principle let its net spending or the money supply react 
to this information; its reactions will modify the price surprises to sup¬ 
pliers. Formally, add —af(p t — E t _\p t ) into (2) and —a m (p t — E t _\p t ) 
into (5) where a/, a m (both positive) are fiscal and monetary responses 
respectively. To simplify matters set pj = p rn = 0. Equation (7) now 
becomes 


CCL f C 

- L )(p t - Et-xp^+wt = p t + (1 + ~)(pt ~ E t -xp t ) 

a a 

- c(E t _ 1 p t+ 1 - E t _xp t ) + (1 + -)y* (14) 

a 


so that from the terms in w t we have: 




1 

1 + (1 + a)0 + a m + 


(15) 


from which it is apparent that the higher a m , a,f the smaller the price 
surprise and hence the output variance. 

One may ask, however, why a government in possession of macro 
information should not release it rapidly as an alternative to implement¬ 
ing such (presumably costly) rules. If it did so, private agents would be 
able to make better informed judgements about current macroeconomic 
events, increasing the economy’s stability. In the example here, if price 
data are released rapidly, then p t will be effectively known in period t 
and the economy will be in continuous equilibrium — perfect stability! 

A further information asymmetry, which may violate neutrality and 
has had some attention (Turnovsky, 1980; Weiss, 1980), is that where 
one group of private agents has superior information to that possessed 
by suppliers and by the government. To illustrate this possibility, modify 
the aggregate demand schedule (2) in the above model to 


yt = ~a(R t - E t pt + i + p t ) 


(16) 


The interpretation of this aggregate demand schedule (16) is that 
investors have instantaneous access to current information on all relevant 
macro data while other agents, such as the government or suppliers of 
goods receive this information with a one-period lag. 

In defence of this idea, it is argued that agents in regular contact 
with asset markets receive global information (such as interest rates and 
asset prices) almost instantaneously, by contrast with those in the labour- 
market. 
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Substitution of (16) into our model in place of (2) yields the following 
reduced form: 

Q 

m + nPipt -i ~ E t - 2 Pt-i ) + u>t = (1 + c)pt + (1 -\—){pt — E t _ip t ) 

a 



— cEtPt+i + (1 H — )y* 
a 

(17) 

where p, = (/ j, m +Pfc/a ) 
are given by: 

as before. Using the Muth solution the identities 


p = m-{ 1+ -)y* 
a 

(18) 

(terms in w t ) 



1 = 

- 7To(l + C + /3(1 + —) — C7T 1 

a 

(19) 

(terms in w t ~ 1 ) 

p,/3no = 7Tl(l + c) — C7T 2 

(20) 

(terms in w t -i, i > 2) 

0 = 7Tj(l + c) — C7T i+ i 

(21) 


Equation (21) defines an unstable process. Consequently applying 
the stability condition, we set 7T; = 0 (i > 2). Therefore we can simul¬ 
taneously solve (18) and (19) to obtain no and tt\. The important point 
is that 7i"o, the coefficient on the current innovation, will depend on 
consequently the variance of output depends on the feedback rules. 

The basis of this result is that the agents in the goods market with 
superior information demand goods this period in reaction to expected 
future prices because these affect the real interest rate they expect to 
pay. Even though expected future output is invariant to the feedback 
rule, expected future prices are not in these models — clearly not, since 
the demand for output is affected by feedback and this in turn has to be 
equated with given output supply by prices and interest rates. So current 
demand for goods is affected by the feedback parameters via their effect 
on expected future prices, and the response of goods demand, and so of 
prices and so of output, to shocks is correspondingly modified. The gov¬ 
ernment can thus exploit these agents’ information without itself having 
access to it. This second asymmetry result is, however, subject to ques¬ 
tioning of a similar type to the first: namely, the basis for the restriction 
of such macroeconomic information to one set of agents. The case for 
macroeconomic information on individual markets being so restricted 
seems more secure, although this is communicable through asset prices. 
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But macroeconomic information, once available, is a public good which, 
first, it is usual for the government to insist be made available at low 
cost; second, even if it is not so provided, it would pay the possessors 
to divulge it for a fee to other agents, since this maximizes the overall 
possibilities for its exploitation; third, asset prices themselves will com¬ 
municate this information indirectly to other agents. The model just 
used furthermore makes the strong assumption that people operating in 
asset markets know all current macro data (this is implicit in taking 
expectations based on current period data), which is clearly implausible. 

In short, the overall set-up here is generally implausible in both the 
asymmetry and the comprehensiveness of the group’s information set. 


PARTIAL INFORMATION 

The result above can be refined to deal with the two objections under 
certain conditions. Suppose we let everyone in the economy have access 
to some partial current information, as discussed in chapter 3. When 
that information is micro, it turns out that flexible rules will affect the 
variance of output. The reason is the same as in the Turnovsky-Weiss 
case: people react to current shocks because they have incomplete infor¬ 
mation but the flexible policy rule affects expected future prices, which 
in turn condition those reactions. Of course, if people could disentangle 
from their current information exactly what the current money supply 
shock was, then they could protect their real wages, relative prices and 
real supplies and demands against mere monetary noise, and a flexible 
money supply rule would be ineffective; but they cannot, and so it is ef¬ 
fective. As for a flexible fiscal rule, that too is effective provided people 
cannot disentangle the shocks well enough to predict the current price 
level exactly: in other words, they have less than full current informa¬ 
tion, which is guaranteed by assumption. 

To illustrate policy effectiveness in the presence of micro partial in¬ 
formation, take the model of chapter 3, equations (15) and (16), in which 
people know their local prices only (models of this sort with policy effec¬ 
tiveness were first set out in Marini (1985, 1986) and Minford, 1986). Let 
us modify the model equation (15) by the addition of a flexible money 
supply response to output, — y{yt~i — y*)and a Cagan-style demand re¬ 
sponse to expected inflation. The model now becomes: 


m +e t — /j,(y t -iy*) =Pt+Vt~ a(E t p t+ i - E t p t ) (22) 


Vt-y * = 


6 


(Pt - E\p t I $t-i]) 


(23) 



Stabilization Policy — Can We? 


115 


where as before <f> = ■ Using the Muth solution method and our 

previous results that E t e t = (pet, we can substitute for y t from (23) into 
( 22 ) to obtain: 

_ (1 -(p) 

m + £ t — y -^-7r 0 e t -i 

^ (W) 

= > , TTiet -1 + y* H- 7 -7T0Ct - cr(7Ti0et - 7T 0 <pe t ) 

t =o 

OO 

+ ^ (7T i+ i - TTi)et-i (24) 

i= 1 

(terms in e t ) 1 = ^ 07(1 - 0 ) 7 r o - atnicp - 7 r 0 (/>) (25) 

0 

(terms in e t _i) - ^(1 - <p)ir 0 = 7 Ti - a( 7 r 2 - 7 Ti) (26) 


(terms in e t -i, i > 2) 0 = (1 + a) 7 Ti_i — a 7 T; + i (27) 

Imposing the terminal condition ttn = 0 (N > 2) yields 7 iy = 0 
(i > 2 ); 7 r 1 = — /j,(l — <P)tto/8', and 


7T0 


1 


1 + 1 + OMp + 


fi( 1+a) 


(28) 


It is clear from (28) that the parameter /i of the flexible rule affects 
7 To and so output’s response to the monetary shock, which is 11 ~^ 7r ° . It 
turns out, when 0 is substituted out in terms of 7 Tq, that (28) is a quintic 
in 7 To- Computer solutions for a wide variety of possible parameter values 
indicate that 7 Tq has only one real root, which is reduced as /j rises: this 
yields the commonsense result that the more policy ‘leans against’ the 
recent business cycle, the more it stabilizes output. Suppose a monetary 
expansion, e t , raises output through a surprise rise in prices, this causes 
an expected money supply contraction through the flexible rule, implying 
expected future price deflation, which in turn raises the current demand 
for money, lowers that for goods and so partially counteracts the upward 
pressure on current prices exerted by the current monetary expansion. 

The process is illustrated in figure 4.1. A'D' shows the aggregate 
demand curve shift from et alone. But next period’s AD curve, A\D\, 
shifts leftwards generating an expected price fall to E t pt+ 1 - This also 
shifts current aggregate demand to A 0 D 0 . The path of prices and output 
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Figure 4.1: A monetary shock, et, with a monetary feedback rule and local 
price information 


is marked by the arrows. The point of the illustration is to show that yt 
rises to less than it would reach without the rule. 

When people have only partial macro information as in our second 
model in chapter 3, equations (17)—(20), this same effect does not in fact 
occur. Consider the effect of p the flexible response of money to past 
events, in that model. While the solution for prices and interest rates 
reflects the response parameter, p, that for output does not. The reason 
is that the current information on interest rates allows people to work out 
exactly what the effect of the feedback response is on expected future 
prices; since it can be worked out exactly, it is impounded into E t p t 
and cannot affect output, which depends on producers being surprised 
by prices. In effect, in figure 3.4 whatever change in E t pt is produced 
by feedback (affecting E t p t+ 1 ), it shifts both the AS and AD curves 
vertically by the same amount, leaving y t and E t y t the same. 

However, when people have both micro information on local prices 
and macro information on interest rates, effectiveness is restored (see, 
for example, Barro, 1980; King, 1982). Let us take as a representative 
model this same model, equations (17)-(20) of chapter 3, and add local 
price information, so that pa = Pt + v lt : this is essentially the model of 
Barro (1980). There are now two pieces of information, pu and Rt. For 
simplicity, let v t = 0 so that there are only two macro errors, e t and u t . 

Plainly both p lt and R t will be used to estimate e t and u t . In this 
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case we would have 


7 ?/Ut — O^uPit ' I 0 u Rt 


(29) 


Et e t — QtePit + ( 3 e Rt. 


(30) 


where the as and /3s would be derived from the regressions of Ut and et on 
pu and R t ;R t = Au t + De t + Zm t -i&nd p it = q 0 u t +7r 0 et+u it +7r TO m t _ 1 . 
It turns out from the regression formulae that: 


O'v/ — 


DI< 

A~ 


(31) 


a P 


AK 


q 0 I\ + Dal a 2 
A 


Pe = 


(32) 

(33) 


/? 




-TTp/v + Augo-g 
A 


(34) 


where A = ( q 0 D - Ano^alal + al(A 2 a'l + D 2 a 2 e ) and K = (q 0 D - 
An 0 )a 2 u a 2 e . 

It follows that 


Aq, u -\- Da e — 0 

(35) 

and 


Df3 e + Af3 u = 1 

(36) 

Equation (24) holds as before so that here 


ae t + Ut — o-EtPt + Et u t 

(37) 


Substituting from E t et and E t Ut. gives: 

ae t + u t = [(aa e + a u )q 0 + (q(3 e + f3 u )A] u t + [(aa e + a u )7r 0 

+ ( ot(3 e + (3 u )D]e t (38) 

yielding two identities in u t , e* as 

1 = qoaa e + ot u qo + u/3 e + A[3 U (39) 


i _ . U u TTQ . r\ a , n D 

1 — QL e 7t 0 H-h Dp + f3 u — 

a 


a 


(40) 
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Letting, from (35), a u = —and, from (36), (3 U = and 

substituting these into (39) and (40) implies that ^ = a. 

Consequently the regression coefficients simplify to 

_ aK _ K a _ q 0 K + aa 2 v a 2 e Q _ -7 t 0 K + a 2 v <j 2 u 

u u - A , a e - A ; Pe ~ AA \Pu~ AA 

where K = ( aq 0 - tt 0 )<t 2 <7 2 and A = ( aq 0 - 7r 0 ) 2 cr 2 cr 2 + ct 2 (ct 2 + a 2 a 2 e ). 
The solution is then worked out for R t as before in chapter 3 yielding 

(1 - a)(l - aVE)(l + c[l - ^]) + a( 1 + c)(l - n)W 
A ~ X ; 

„ _ -(1 - a)a(l + c[l - fj])V + a(l + c)(l - /u)(1 - V ) 

“ X 

where W = aa ^‘ ; V = and X = a(l + c)(l + c[l — fj,]). 

Finally, using the expression for p t = noe t + qoU t +m t - 1 , 7To, qo and 
7r m are found by the undetermined coefficients method. 

We then find that 


yt = (1 - aW)u t + aVe t (41) 

It turns out therefore that ^ indeed enters the determination of yt = 
E t u t because it affects the weight of R t and pu in forming expectations 
of ut and so also 7To, qo, A, D. 


AUTOMATIC STABILIZERS: AN ASIDE 

Before going on to consider alternative assumptions about the supply 
curve we digress briefly to discuss the potential goal of ‘automatic’ sta¬ 
bilizers. By an automatic stabilizer we mean a mechanism in which a 
variable (for instance, tax liabilities) responds to current income lev¬ 
els, and therefore provides an automatic and immediate adjustment to 
current disturbances. 

This is to be distinguished from policy actions in response to global 
information, such as we have been considering hitherto; ‘automatic’ im¬ 
plies that the response is effected at the microeconomic level, without 
recourse to macroeconomic information or to higher political authority. 
Tax liabilities, when tax rates are set, are of this sort: only the taxpayer, 
his income and the tax man are involved. In the monetary area, certain 
open market procedures — such as pegging central bank liabilities by 
Treasury bill sales — also fall into this category. The work of Poole 
(1970) on monetary policy in a closed economy and of Parkin (1978) on 
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monetary and exchange rate intervention in an open economy can be 
regarded as dealing with these types of stabilizer. 

McCallum and Whittaker (1979) considered the properties of auto¬ 
matic tax stabilizers and showed that they do influence the variance of 
output. Their point can most easily be demonstrated by writing the 
aggregate demand schedule as: 

Vt = a'(Rt - Et-ipt+i + p t ) - cT t yt (42) 


where t is the direct tax elasticity and a is the elasticity of spending to 
temporary variations in tax liabilities. The tax elasticity, 

_ 31n(tax receipts) _ () 

3 In (output) “ ( 5 ^) 

is the marginal tax rate divided by the average tax rate. 

If we define 


a' 

1 + at 


(43) 


then the solution of the model (42), (3), (4) and (5) is the same as 
(9) and (10), but where a is defined as here in (43). Consequently 
the solution for output is not independent of the automatic stabilizer, 
given this orthodox aggregate demand function ; 5 a higher tax elasticity 
reduces the variance of output. However, although a high tax elasticity 
contributes to a reduction in output fluctuations, it does so at the cost 
of distortions to the operations of markets at the micro level: the highest 
tax elasticity of all is obtained when the marginal rate is 100 per cent! 

The role of automatic stabilizers of this sort is quite distinct from 
that of feedback policy, although sometimes they are confused in popular 
discussion. It is, as we have seen, preserved within the Sargent and 
Wallace model considered above. However, there is one interesting set 
of conditions under which a particular monetary stabilizer is ineffective. 
This is where people have access to the same partial macro (or micro) 
information responded to by the monetary authorities. 

Consider an ‘automatic response’ to the current interest rate, as dis¬ 
cussed by Poole (1970), in the context of the macro model with partial 
macro information in chapter 3, equations (17)-(20). Suppose we rewrite 
the money supply function (19) as 


m t = pmt -i + r/Rt + e t 


(3.19') 


Assume first that the monetary authorities can respond at a micro 
level (e.g. in the treasury bill market) to a market interest rate, with the 
effect aggregated over the whole security market of r)R t ; assume also 
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that no one observes the aggregate interest rate, R t . Then the effect is to 
augment c, whenever it occurs in the solution, to d = c+rj (flattening the 
LM curve). This will, as Poole suggested, dampen the effect of money 
and supply shocks, u t and e t , and augment that of demand shocks, v t , 
as can be verified from the first line of table 3.1. 

Now suppose R t to be known to all as partial macro information; 
then the same policy (now no longer a response to micro data, but one 
to macroeconomic information) has no effect on output at all, as can be 
seen from the second line of table 3.1 where c does not enter. 

We therefore have the result that interest rate stabilization is ren¬ 
dered ineffective (on output) in a Sargent -Wallace framework when the 
interest rate is universally observed. The reason is that any such re¬ 
sponse in impounded into E t p t (because people can work out the money 
change due to rjR . t ) and cannot affect the surprise element p t — E t pt- 

This would not be true of any variable to which the monetary au¬ 
thorities could respond at a micro level and which was not universally 
observed, as in the case above with R t when unobserved. In this case 
people could not work out the money change to this response, and it 
could affect the surprise element, p t — E t P t . However, plausible candi¬ 
dates for such a variable are hard to think of. 

Nevertheless, the interesting possibility is introduced by macro infor¬ 
mation that the authorities can reduce the variance of output by raising 
the variance of the money supply shock, et, i.e. by deliberately mak¬ 
ing larger rather than smaller mistakes. Previously this was impossible; 
higher vare necessarily implied higher vary since e* entered tjt additively. 
But now vare affects the coefficients of the y expression via <j > u , (f> v , (j) e . 

Consider the asymptotic variance of y, a 2 . Substituting for <f > u , (p v , 
<f> e from (32) in the y expression (table 3.1, line 3) we obtain: 


2 2 2 
a <rj - aaj 

X' 


+ (! + a )vl l2 r _2 , „2_2i 

[--^7-J K + a °eJ 


(44) 


Now we find that as a 2 —> oo and <j) e —> 1, a 2 —> a 2 (the variance 
of the demand shock). In this case, the variance of output is dominated 
by the variance of demand shocks because suppliers become totally un¬ 
responsive to prices, believing them to reflect solely ‘noise’ in the money 
supply. It is clear that this may reduce the variance of output compared 
to the no-monetary-noise model; thus as a 2 —» 0, 

2 a 2 ^Ml) 2 + Wl + (l+a)^l 

W + (l + a)VJ2 1 j 


which, depending on a 2 and a, can exceed a 2 . 

Yet it can be shown that a 2 is an inappropriate indicator of welfare 
and that the optimal policy is, commonsensically, to minimize a 2 (that 
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is, for the central bank cashiers to make as few and as small mistakes as 
possible). 

Abstracting from the usual problems (public goods, externalities, in¬ 
complete markets, etc.) the Pareto-optimal situation under uncertainty 
is one of Walrasian equilibrium when all the shocks are known to all 
agents (this is discussed at greater length in chapter 5). In the context 
of our model output would in this situation be simply y t = u t , because 
E t pt. = Ptand u t , the supply shock, would shift our vertical supply curve 
fully along the quantity axis. 

The optimal outcome under uncertainty, under normal assumptions 
for social welfare, is one which minimizes the variance of output from 
this outcome, as well as ensuring that this is the expected outcome: 
that is, such that E t y t = E t u t and a 2 u = E(y — u) 2 is a minimum. All 
rational expectations outcomes, whatever the information set, guarantee 
that E t y t = E t Uf The problem therefore reduces to choosing a 2 to 
minimise a 2 u . However a 2 u = a 2 E(p t — E t pt.) 2 , so that the optimal 
policy is equivalently to minimise the variance of unanticipated price 
changes a 2 e . 

Using our earlier expression for y t (table 3.1, line 3), we find that 


(Pt - E t Pt) = 


Vt - u t 
a 


-[cf) e + a( 1 — <f> u )]u t + ((1 + a)cp u + (p v )ae t 

+[(1 + a)<p e - a(l + a)(t> v )\vt _ 

a(l + a) 


Hence 


[< + fl2 (l — 0 U ) 2 + 2a^ e (l — (p u )]&u + [(1 + a) 2 '/’ 2 + (pl 
+2(1 + a)<p u <p v ]a 2 a 2 + [(1 + a) 2 ^ + a 2 (l + a) 2 0 2 
2 —2a(l + a) 2 (p e (p u }(j 2 , 

Tpe ~ [a(l + a)] 2 

As (T^ —> 0, we find that 


a: 


2 2 
a v°u 


pe a 2 u + {\ +a) 2 a 2 


and that as a 2 —»■ oo 


0-2 + vl 


pe 


The ratio of (36) to (35), K, is given by 

R ( al+a 2 u )l* 2 u + (l + a) 2 a 2 v ] > 1 

a 2 a 2 v a 2 u 


(46) 


(47) 


(48) 


(49) 


(50) 
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so that the low extreme dominates the high extreme. 

We can also show that minimizing a 2 minimizes a 2 e ; for > 0 
throughout the range of a 2 . 

Differentiating (47) yields: 


+ ° 2 + 2 a{(p e + (f) v )\ + 4 a 2 (p 2 u (j) v + (j) 2 v (p e 
^ a pe _ +4 a4' e ( t > u ( Pv + (<Au + <A)[(1 + + <t > v ] 2 

<5cr 2 (1 + a) 2 


Hence welfare is unambiguously maximised by minimising the vari¬ 
ance of the money supply, as we would instinctively expect to be the 
case. 


MODELS WITH LONG-TERM NON-CONTINGENT 
CONTRACTS: THE NEW KEYNESIAN PHILLIPS CU¬ 
RVE 

One of the assumptions required for anticipated monetary policy to have 
no effect on output in the Sargent-Wallace model is that agents are 
able to act on their information sets. If we have a situation where, for 
instance, private agents cannot respond to new information by changing 
their consumption, wage-price decisions, etc., as quickly as the public 
sector can change any (at least one) of its controls, then scope once 
again emerges for systematic stabilization policy to have real effects. 
This insight was developed principally by Fischer (1977a, b) and Phelps 
and Taylor (1977) in the context of multi-period non-contingent wage or 
price contracts. 

Suppose we have a situation where all wage contracts run for two 
periods and the contract drawn in period t specifies nominal wages for 
periods t + 1 and t + 2. At each period of time, half the labour force is 
covered by a pre-existing contract. As long as the contracts are not con¬ 
tingent on new information that accrues during the contract period, this 
creates the possibility of stabilization policy. Firms respond to changes 
in their environment (say, unpredictable changes in demand which were 
unanticipated at the time of the pre-existing contract) by altering out¬ 
put and employment at the pre-contracted wage; only contracts which 
are up for renewal can reflect prevailing information. If the monetary 
authorities can respond to new information that has accrued between 
the time the two-period contract is drawn up and the last period of the 
operation of the contract, then systematic stabilization policy is possi¬ 
ble. In other words, while there are no information differences between 
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public and private agents, the speed of response to the new information 
is different. 

The essentials of this argument involve replacing the Sargent-Wallace 
supply equation (3) with one based on overlapping contracts. Suppose, 
following Fischer (1977 a, b), that wages are set for two periods so as to 
maintain expected real wages constant at a ‘normal’ level. Denote (the 
log of nominal) wages set in period t — 1 for period t as t-iWt- Then 

t-iWt = E t -iPt (52) 

(where the log of normal real wages is set to zero) and current nominal 
wages are 

W t = 0.5(t~2Wt +t—i W t ) = 0.5(E t -2Pt + Et-iPt) (53) 

Now let output supply be a declining function of the real wage (from 
firms maximizing profits subject to a production function with labour 
input and some fixed overheads): 

y t = -q(W t -p t ) + y* (54) 

We derive from these the new supply equation: 

Vt = 0.5 q[(p t - E t _ 2 pt) + ( p t ~ E t _\p t )} + y* (55) 

This equation, the New Keynesian (NK) Phillips curve, and its deri¬ 
vation are illustrated in figure 4.2 (for Y = expy), which also contrasts 
it with the New Classical (NC) Phillips curve, in a diagram also taken 
from Parkin and Bade (1988). In figure 4.2, as p rises from its expected 
level po to pi, MPL+p (the wage offer for labour where MPL is the log 
of labour’s marginal product) shifts: under NC nominal wages are bid 
up along SS, under NK wages are fixed at the contract level Wo- Hence 
the NK Phillips curve, Snk, is flatter than the NC Philips curve, Snc- 

Let us use (55) in place of (43), together with the rest of the model 
(2), (3) and (5); then it can conveniently be written in terms of the Muth 
solution as: 


y t = q( n 0 w t + 0.57riiu t _i) + y* (56) 

The model solution equation can now be written: 

Q 

m + qp,(TT 0 w t -i + 0.57ritx; t _2) + w t =p t + q{ 1 + -)(n 0 w t + 0.57Titc t _i) 

- c(K t _ip t +i - Et-ipt) + (1 + —)y* (57) 

a 

The identities in the w t-i are now: 

1 = 7T 0 [1 + q(l + -)] 
a 


(terms in Wt) 


(58) 
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Figure 4.2: The New Keynesian Phillips curve (contrasted with the New Clas¬ 
sical) 


(terms in iu t - 1 ) 

Q 

= 7Ti + 0.5g(l H- )7T! - c( 7T 2 - 7Ti) 

a 

(59) 

(terms in w t ~ 2 ) 

0.5qpni = 7T2 — c(7t 3 — 7t 2 ) 

(60) 

(terms in w t -i, i > 3) 

0 = (1 + c)7Ti - C7T i+ i 

(61) 


Equation (61) gives 7 n = 0 (i > 3) applying the stability condition, 
whence we can solve the other three equations for 7T2, 7 Ti, 7to. /j enters the 
solution for 7Ti and n 2 and, since 7Ti enters the output supply equation, 
H therefore influences the variance of output. In fact in this particular 
example, it will raise the variance; minimum variance occurs where fi= 
0, since this sets 7ri=7T2=7ro. 

The model is illustrated in figure 4.3. With feedback, p ^ 0, we 
obtain the path shown. Suppose there is a temporary aggregate demand 
shock in t = 0, shifting AD to A 0 D 0 . The supply curve, S*S*, whose 
position is fixed by E t _ 2 Pt = E t _\p t = p*, does not move; we reach 
point 0. Next period, aggregate demand is shifted by negative feedback 
on yo to A\D\. Half the workers have now renegotiated wages in t = 0 
with EqPi = pi (their expectations of pi in period 0), So the supply 
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curve shifts to S\ Sj . Next period aggregate demand shifts to A 2 D 2 as 
feedback now raises it in response to y\. All workers have renegotiated 
wages, fully expecting p 2 = E 0 p 2 (there is no further surprise in prices 
relative to wage negotiations); so j /2 = y*. In period 3, finally, feedback 
stops so both aggregate demand and supply return to normal, that is 
A*D* and S*S*. 

Of course, the diagram shows clearly that had the aggregate demand 
curve not reacted with feedback, then in period 1 it would have returned 
to A*D* and workers would not have needed to renegotiate wages, stay¬ 
ing on S*S*. Thus the path would have been direct from point 0 to 
point 3, clearly more stable than the path with feedback. 



The path w ith feedback is show n by numbers; the path w ithout 
feedback is 0, 3, 3... 


Figure 4.3: The effectiveness of feedback response in the New Keynesian model 

This example illustrates the obvious point that the case for stabiliza¬ 
tion policy does not rest with showing effectiveness; it is also necessary 
to show optimality. Nevertheless, it is easy to construct examples where 
p 7 ^ 0 minimizes output variance; the reader should investigate one such 
as an exercise, namely when an adjustment term +j(yt -1 — y*) is added 
to (55). The reader should find that, while 7 r Q is unaltered, the expression 
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for 7ri becomes: 


+ ,,K - !| + e _ 

1 + c(l — j) a 1 + c 


Since j(no — 1) <0, the optimal value of y found by minimising the 
variance of y — y*, sets Eo(yt+i —y*) = 0 after a shock in t = 0, implying 
that noj + 7Ti = 0. It turns out that the optimal value of y = ~ J o 5 q C) 
is negative, representing the normal ‘leaning into the wind’ response. 

Models with overlapping contracting have been developed by Taylor 
(1979a, b, 1980) in a series of papers in order to show that important 
features of the business cycle can be captured by integrating this type 
of supply curve into standard macroeconomic analysis, and to analyse 
optimal policy design in such an economy. 

Three points of weakness remain in this approach. First, the theoret¬ 
ical basis of non-contingent contracts, in which the nominal wage or price 
is fixed and quantity is set by demand, has not been established to uni¬ 
versal satisfaction. One approach is to assume that in incomplete mar¬ 
kets nominal wage contracts offer insurance against real shocks. Another 
approach is to assume ‘menu costs’, that is transactions costs in fixing, 
negotiating and changing prices which are reduced by periodic contracts: 
this approach (assumed by earlier authors such as Barro (1972), Mussa 
(1981) and Canzoneri (1980)) has been explored theoretically by, for ex¬ 
ample, Rotemberg (1983) and Parkin (1986). What is harder to establish 
is how large these menu costs are and why, given that non-contingent 
contracts risk losses in the face of shocks, the contracts do not build in 
contingency clauses (which may be less expensive to write in than the 
potential losses they avoid). The humdrum answer may be that people 
are approximately risk neutral for small risks and so reduce menu costs 
by writing non-contingent contracts that expose them totally to these 
risks: they only bother with contingency clauses (or other insurance) for 
the large risks. This tallies with other insurance practices, such as excess 
clauses and no claims bonuses, which effectively exclude the small risks. 

If this is so, then the appearance of non-contingent contracts may 
be deceptive, the second weakness in this approach. They may be truly 
non-contingent for only rather trivial shocks. Indeed, closer inspection 
reveals that actual contracts are exceedingly complex once implicit ele¬ 
ments are taken into account. For example, they will typically include 
bonus, discount and lay-off elements for quantity variation, and index¬ 
ation (whether formal or informal via shop-floor renegotiation) is fre¬ 
quently found. 

The third and related weakness is that, if the authorities were sys¬ 
tematically to exploit these contracts in a way not envisaged at the time 
the contracts were set, then this would presumably lead to differences in 
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the way contracts were written (contract length and indexation clauses 
are clearly endogenous). In the limit, if the government systematically 
exploited them in a way that altered agents’ outcomes excessively from 
what they wished, then long-run contracts would be written in such a 
way that they were equivalent to a succession of single-period contracts 
so that the scope for stabilization policy would disappear. 

For all these reasons (we revert to these issues in chapter 6) there 
remains considerable doubt as to whether non-contingent contracts can 
be regarded as a firm basis for modelling and policy formulation. Nev¬ 
ertheless, in practice they are widely used in macroeconomic modelling, 
since they both appear to be widely used and do pick up usefully the 
short-term nominal rigidity observed in wages: clearly this implies that 
we must treat analysis based on such models with caution — but then 
what is new about that? 

One final point: the ‘New Keynesian’ and ‘New Classical’ supply 
curves are often presented as a contrast between ‘disequilibrium’ and 
‘market clearing’ approaches. This can be misleading. The fact that 
people may sign non-contingent contracts does not imply either that 
they are in disequilibrium or that markets do not clear when shocks 
occur later on during the contract period. Obviously they were aware 
this could happen (hence no ‘disequilibrium’) and planned not to vary 
their price in response to changed demand (hence their supply is elastic, 
‘clearing’ the market). We are dealing with a different (non-auction) 
market structure entered into voluntarily by rational agents: this implies 
different properties in response to shocks, that is all. It is quite different 
from the Keynesian or old Phillips curve assumptions set out in chapter 
1 . 


NEW CLASSICAL MODELS WITH INTERTEMPO¬ 
RAL SUBSTITUTION 

The last group of models we wish to examine for feedback effectiveness 
is New Classical models with intertemporal substitution fully operative. 
The earlier New Classical models of this chapter suppressed one mecha¬ 
nism, the role of real interest rates in varying labour supply; empirically, 
this mechanism itself is of doubtful significance but in the open economy 
movements in real interest rates are associated with movements in the 
real exchange rate, and these are found to have powerful effects on labour 
supply, as is discussed in chapter 8. We can think of this closed economy 
mechanism as a proxy for that powerful open economy mechanism. It 
has some interesting theoretical implications for policy effectiveness. 
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As explained in chapter 3, the New Classical supply function is de¬ 
rived from workers or consumers maximizing expected utility subject to 
a life-time budget constraint (a nicely tractable set-up, which has been 
explored by Sargent (e.g. 1979a, chapter 16), is the quadratic utility 
function with quadratic adjustment costs). From such a framework one 
can obtain a formal supply of labour equation of the form: 

n t = /K, wt+i -w t - r t , nt- 1 ) (63) 

where w is the real wage, n is labour supply (both in logarithms) and r is 
the real interest rate, which we now treat as a variable. The e superscript 
denotes expected at time t. The information set assumed in this is last 
period’s macroeconomic information and each worker also observes at 
the micro level his or her current nominal wages; but we assume that no 
micro information is usable for signal extraction about macro data. So 
wf = Wt — E t -ipt = Wt + Pt — Et-iPt where W t is nominal wages (in 
logs). The first term in (4.63) represents the long-term effect of rising 
wages on supply, while the second represents intertemporal substitution 
with a single-period ‘future’ for simplicity; the third represents costs of 
adjustment. «?® +1 is standing in for the whole future path of real wages 
and it will be helpful for our purpose here to treat it as a constant, ‘the 
future normal real wage’. 

Let us write the equation in (log) linear form as: 

nt = cr 0 +<Ji(w t +pt~ Et-ipt) + a 2 r t + jn t -i(0 < j < 1) (64) 

Now juxtapose this with a demand for labour function (65) derived 
from a simple Cobb-Douglas production function, y t = 6k t + (1 — 6)n t , 
with a fixed overhead element k t 

nt = yt — w t (65) 

from (65) and the production function we have 

nt = k t - \w t (66) 

o 

or 

w t = -Snt + Skt (67) 

Substituting for w t from this into (64) gives 

1 akt+(J 0 +a 2 r t + (Ji{pt-Et-tPt) > 

n t = t--1- j - (68) 

1 H - cl 1 — qL 

where q = a = 6a\. 
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Using (66) gives us: 

2 _ ^ 

Dt = (1 - qL)6k t + + °o + <r 2 r t + <ri(p t - E t -ip t )] + qy t - i 

(69) 

The steady state values of r t and y t (r*, y*) will depend on the whole 
model while k t we assume to be held constant here; for simplicity we will 
normalize them all at zero in what follows. 

Now write the full model as: 


y t = -ar t + y f (y t - i) 

(70) 

= dr t + (3(p t - Et-ipt) + qy t - i 

(71) 

mt=Pt + yt- cRt + V t 

(72) 

m t = m + y m yt-i + u t 

(73) 

R t = r t + Et-ipt+i — Et-ipt. 

(74) 


Equation (70) is the IS curve with the fiscal feedback parameter y^. 

(71) is the supply curve with d, (3, q taken from (69) (e.g. d = y-j^.<7 2 ). 

(72) is money demand, (73) is money supply with feedback parameter, 
y m . (74) is the Fisher identity. 

We can immediately establish by (70) and (71) that fiscal feedback 
is effective, but monetary feedback is not. We obtain 


1(1 -l* f L) 


C Pt - Et-iPt) 


(75) 


This expression for r t then can be substituted into (70) to obtain y t : 
clearly the reaction of ijt to unanticipated prices depends importantly on 
yj but not on y m . As for p t — E t ~iPt, this is quickly found as: 


p t - Et-ipt 


a + d 

a T d T (3{cx T c) 


(' u t - v t ) 


(76) 


The intuition behind this result is that fiscal policy is causing in¬ 
tertemporal substitution of supply, in order to offset the ‘cyclical’ effects 
of shocks. Incidentally, this effect of fiscal feedback is quite independent 
of whether government bonds are net wealth (discussed in chapter 7). 
For example, even if private consumption depends only on permanent 
income and not on transitory income, the government expenditure pat¬ 
tern over time could be altered without affecting the present value of 
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the tax stream, so altering the pattern of total demand over time. Of 
course, if private consumption depends also on transitory income, then 
alteration of the temporal pattern of taxes, holding the present value of 
the tax stream constant, would also have this effect. Such alteration of 
the patterns of aggregate demand over time then sets off the movement 
in real interest rates which creates intertemporal substitution in supply. 

These points are illustrated in figure 4.4, where it is assumed that 
q « 0.5 and pf « —1; the diagram is in (r, y) space instead of the more 
usual (p, y) space, to focus on real interest rate movements. Initially, we 
assume some money supply shock drives prices up, unexpectedly shifting 
the SS New Classical Supply curve rightwards; real interest rates drop to 
point 0 along the original IS curve. Now, if there were no fiscal feedback, 
Pf =0, the SS curve would shift back to S*S* at the rate of 50 per cent 
of (yt-i ~y*) per period. The path would be traced by the arrows along 
the I*S* curve. With fiscal feedback, the IS shifts leftwards to I±Si 
reaching point 1, where output is at y*; hence in period 2 we return to 
to I*S* (plainly faster in this example than that with no feedback). The 
path of output is seen to be fully determined by this diagram; monetary 
feedback policy enters neither curve, so is ineffective. Only the shock to 
the money supply enters through p t — E t -\Pt- 

The ineffectiveness of monetary feedback policy is negated by the 
introduction of wealth effects into the IS curve (or the supply curve). 
Assume that consumers hold long maturity bonds with fixed coupons 
denominated in money terms: these must be government bonds in this 
closed economy, since any private sector bonds would net out in a consol¬ 
idated private sector balance sheet. If we treat these government bonds 
as net wealth (chapter 7), then variations in the price level brought about 
by monetary policy, future government expenditure constant, will alter 
the real value of these and so net wealth and spending. 

The point can be seen by adding the term f(b—p t ) into (70) 1 . We set 
Pf = 0 since fiscal policy will remain effective as before; and to simplify 
the algebra here we set q = 0 = b. Using (70) and (71) we now obtain: 

r t = —f'Pt ~ 0{Pt ~ E t -iPt)(77) (77) 

where f = 0 = 0^. Equating (72) and (73) and substituting 

into the result (77) for r t and (71) for y t we obtain the following equation 


1 6, the example here, would be the log of a perpetuity bond issue at =£100 face 
value which promises to pay £100xRt (Rt being the perpetuity rate of interest) for 
all future periods; its present value will always be .£100. The point made here goes 
through for all types of nominal bonds; only if bonds are indexed will it not do so, 
for the obvious reason that the path of prices becomes irrelevant to the value of net 
wealth. 
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Monetary feedback is ineffective because only 
inflation surprise enters the SS curve. 


Figure 4.4: The effectiveness of fiscal policy with intertemporal substitution 
of supply 


in pt- 


(1 - f'd + f'c)p t + f n m dp t -i - cE t ~ipt + i + cE t ^ipt + 

((3 - 0d + 0c)(p t - E t -ipt) - p m (/3 - 0d)(p t -i - E t -ip t ~i) 

= rn + u t - v t (78) 

oo 

If Pt = 53 7 TiWt-i +p where w t = u t — i’t then the identities in the 
i=0 

w t -i are: 

(w t ) [(1 - f'd + f'c) + {(3 - 0d + /3'c)]tt 0 = 1 (79) 

(w t -l) (1 - f'd + /'c)7Ti + f'p, m TT 0 - C7T2 + C7Tl - /fJ?i(/3 - 0d)TT 0 = 0 

(80) 

I > 2) 7r f+1 - + 1 + f')i U - —————71^—1 = 0 

c c 

(81) 

Suppose (81) has a unique stable root <5: then tt 2 =6tti where 6 de¬ 
pends on pL m . 7Ti also depends on p m from (80). Now output is given 
by (71) using (77), as: 

Vt = —df'pt + a0(p t - Et-ipt) 


( 82 ) 
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from which it is apparent that p, m enters the solution for output too. 
Wealth effects make monetary feedback policy effective. 

The full classical model of labour supply therefore yields two interest¬ 
ing propositions. First, without wealth effects fiscal feedback is effective 
but monetary feedback is not (this is noted by Sargent, 1979a, chapter 
16). Secondly, with wealth effects both are effective. Again, this by no 
means establishes that feedback rules are beneficial. Sargent (1979a), 
for example, shows that if welfare is measured by the sum of identi¬ 
cal consumers’ expected utility, then with no wealth effects zero fiscal 
feedback is optimal in the case of quadratic utility and production func¬ 
tions. That issue we defer. As for the existence of wealth effects, on 
which the effectiveness of monetary policy turns, that too is an issue 
requiring separate discussion; theoretically and empirically it is at this 
point an open question (chapter 7). Nevertheless, as a minimum it is 
of some interest that, even without signal extraction from local prices, 
new classical models in general give scope for fiscal feedback and across 
a potentially broad class also give scope for monetary feedback. This 
is contrary to the impression given (no doubt unintentionally) by much 
of the early literature, although subsequently corrected by Lucas and 
Sargent (1978). 


CONCLUSIONS 

We have shown in the context of equilibrium linear models that there 
is one main set of assumptions under which neither monetary nor fiscal 
feedback policies have an impact on the variance of output: these are 
a New Classical (or old Keynesian with rational expectations) Phillips 
curve of the sort assumed by Sargent and Wallace, without signal ex¬ 
traction from local prices, without intertemporal substitution in supply 
induced by real interest rates, and without information asymmetries. It 
would be turgid and counter-productive to list here again all the con¬ 
ditions under which effectiveness of either fiscal or monetary feedback 
policy is or is not preserved. 

The general proposition in this chapter is that rational expectations 
as such do not rule out counter-cyclical policy, but rather they alter its 
impact 2 , leaving it as an empirical matter whether they do or do not re- 


2 This viewpoint is in principle reinforced by work (e.g. Dickinson et al., 1982) 
which has taken up a suggestion of Shiller (1978) and shown that if a non-linear 
version of the Sargent-Wallace supply function replaced their original linear version 
then even retaining all other assumptions, stabilization policy is feasible. Clearly 
non-linearity will be a typical feature of models of national economies; neverthe- 
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duce the variance of relevant macroeconomic variables, and as a further 
issue whether they do or do not improve welfare. We also considered 
automatic stabilizers briefly and showed that their distinct role was not 
nullified by rational expectations, except in the specific New Classical 
case where people have current access to the same information that trig¬ 
gers the stabilizing mechanism; in this case output will be invariant be¬ 
cause people will incorporate the response into their price expectations. 
Once it is appreciated that stabilization policy is in general not ruled out 
by rational expectations models, whether New Classical or not, the issue 
of whether the economy is subject to ‘disequilibrium’ (a misnomer for 
non-contingent contracts) ceases to be of special significance: it is just 
one of a number of questions that have to be confronted in the detailed 
specification of a rational expectations model. It is the rationality of 
expectations itself that carries the really powerful implications for the 
nature of the impact of stabilization policy. 


less, it seems doubtful that this source of stabilization leverage is of much practical 
importance. 



5 

Stabilization Policy — Should we? 


In Chapters 3 and 4 we discussed the potential effectiveness (for output) 
of stabilization policy by which we mean feedback policy rather than 
‘automatic stabilizers’ which will generally be effective for reasons we 
explained. We have shown that in general such policy can be effective. 
The question then arises of whether any such policy is optimal and, if so, 
how it should be designed. In practice we will focus on monetary policy, 
since fiscal policy is generally regarded as too slow in implementation and 
too uncertain in effect to be useful (besides the concern that varying 
spending programmes and tax rates may cause micro distortions). In 
this chapter we will discuss the issue of optimality: should we stabilize 
the economy? In the next we will discuss practical issues of design. 

We will discuss two issues that bear on optimality: the welfare cri¬ 
terion and the question of credibility (or ‘time inconsistency’). The first 
concerns whether it is not better to leave the economy alone, letting it 
react in accordance with the decisions of private agents; has the gov¬ 
ernment reason to believe it can improve on these? The second issue 
concerns the problems raised by the government’s natural desire to ma¬ 
nipulate the economy in order to please its electorate; this may lead 
to the abandonment of inflation pledges in order to stimulate economic 
activity. Thus governments may find it hard to pursue a consistent anti¬ 
inflation policy and this will undermine its credibility. 


THE WELFARE CRITERION 

The yardstick of welfare that is most widely used by economists is the 
Pareto criterion. Pareto optimality is the condition in which no one can 
be made better off without someone else being made worse off; it can be 
illustrated for a two-person world by an Edgeworth box diagram (figure 
5.1), in which the ‘contract curve’ joins the points of tangency of the 
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two sets of indifference curves, these points being all Pareto-optimal. If 
it is then assumed that distributional considerations are absent, either 
because they do not matter or because the government achieves the 
socially desired distribution at all times, there can be a unique Pareto 
optimum for the specified distribution, illustrated by the circled dot in 
figure 5.1. 


Person B 



Figure 5.1: An Edgeworth box diagram 

The proof that, for a given distribution function there exists a unique 
Pareto optimum has been established for an economy with well-behaved 
preferenes and technology in a Walrasian equilibrium when markets are 
competitive and complete and there are no distortions (discrepancies 
between private social costs due, for example, to externalities); the proof 
is due to Arrow, Debreu and Hahn (see Arrow and Hahn, 1971). In an 
economy of this type, the steady-state level of output at the Pareto 
optimum would be the y* in our models. 

Of course, in actual economies, with incomplete and some monopo¬ 
listic markets and with distortions, y*, the equilibrium level of output, 
will not be at a Pareto optimum. However, it is usual in macroeconomics 
to assume that these problems are the province of ‘micro’ policies, and 
that there should be no attempt by macroeconomic policy to push y 
systematically away from y *; indeed we have already seen that any such 
attempt in an adaptive expectations model would cause ever-accelerating 
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or ever-decelerating prices and in a rational expectations model would 
be completely frustrated. Instead y* is taken, from the viewpoint of 
macroeconomic policy, to be the optimum output level in a steady state 
(it is, strictly, the ‘second-best’ optimum — but the best that can be 
achieved with macroeconomic policy instruments). 

Supposing this to be so, the question arises of what is the optimum 
short-run output level. In Keynesian disequilibrium models with adap¬ 
tive (or other backward-looking) expectations, if output is less (or more) 
than y*, this is involuntary and suboptimal, the result of ‘market failure’; 
hence the variation of output around y* is a natural criterion for mini¬ 
mization. 1 Since output tends to y* in the long run but y* will typically 
be growing over time, we can measure this variation as the variance of 
output around its moving trend var (y — y*). This is what we focused on 
in the discussion of effectiveness in chapter 4. 

Sometimes in these models other measures have also been included 
in the minimand to represent the costs due to loss of consumer surplus 
not included in GDP — for example, the variance of inflation or interest 
rates as proxies for consumer and financial uncertainty. However, it is 
usually assumed in the context of stabilization policy that the variations 
in these costs across alternative policies are relatively small; this would 
also be our assumption. 

In equilibrium models with adaptive expectations (such as the one 
implicit in Milton Friedman’s, Friedman, 1968), output deviations from 
y* arise because of expectations errors which could have been avoided 
by efficient use of available information. Again var(y — y*) is a natural 
minimand because agents would wish they had made output decisions 
on the basis of good forecasts. 

In equilibrium models with rational expectations, however, y only de¬ 
parts from y* because of unavoidable expectations errors. Such models 
include the New Keynesian ones of chapter 4, because the nominal wage 
contracts in these models are voluntarily negotiated, in the full knowl¬ 
edge that when shocks occur the response to them will be constrained by 
the contract. (This leaves open the possibility that monetary policy can 
improve the private sector’s feasible outcomes from its contract structure 
— we assume not here but return to the issue in chapter 6.) 

Output is always at its ‘desired’ level in these models in the sense 
that, given available information, agents are maximizing their welfare 
subject to their private constraints, including the effects of shocks and 
associated expectations errors. Then, provided the level of distortion in 


Hf output, however, changes for supply reasons (e.g. crop failure), this would 
naturally be added into y* and the variation computed around this adjusted figure. 
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the economy does not vary with (y—y*), government cannot improve and 
may reduce welfare by reducing var(y — y*) for the simple reason that 
it was already being maximized; an example of this is given by Sargent 
(1979, chapter 16) in a classical labour supply model where fiscal policy 
is effective because of intertemporal substitution, as discussed in chapter 
4, and the same point has been stressed by Beenstock (1980). 2 

The proviso that the level of distortion does not vary with (y — y*)will 
be violated in practice when unemployment benefits do not vary with 
wages; for when depressive shocks reduce output and employment, the 
gap between the private and social costs of unemployment will increase. 

A simple way of evaluating the social gain to stabilization policy is 
shown in figure 5.2. Assume that the undistorted supply of labour would 
be at L + ; it is entirely inelastic. Assume also that if employed it could 
be put to work at the normal wage of w*, this representing its social 
product because there is some long-term project (e.g. building a road) 
whose return is the same in slump as in boom. Then assume that policy 
can either stabilise employment (output) in face of shocks entirely at L* 
or allow employment to vary to L\ on the upside and Lo on the downside. 
The extra employment has social utility, since it gets people closer to the 
optimum, L + . We can measure the social loss as w*{L + — L). 

We now notice from figure 5.2 that an upward shock has less absolute 
effect on employment than a downward shock because the supply curve 
gets progressively flatter as wages fall, getting closer and closer to the 
benefit level. This implies that as the economy fluctuates the average 
level of employment falls. It follows that the social loss is on average 
greater too. So plainly fluctuations lower welfare. 

Policy makers would of course like to keep the economy at L + , min¬ 
imising fluctuations around this point. However, stabilization policies 
which attempt to boost L over L* (y above y*) systematically will be 
frustrated in the attempt and will create other problems for the econ¬ 
omy (inflation in the case of monetary policy and high taxation in the 
case of fiscal policy). Hence var {y — y*) is the minimand subject to 
the constraint that stabilization policies are not on average biased away 
from y*. 

To sum up, in rational expectations models, there are prima facie 
reasons to believe that in general stabilization policy, even if effective, 
will not improve welfare, the main exception being where distortions are 
correlated with the cycle; the importance of this exception will vary with 
the nature of the benefit system. 


2 This situation is not one of full Pareto-optimality, as markets are incomplete and 
distortions assumed to exist; however, it is a situation of restricted Pareto-optimality 
(second-best), under the provisos given. 
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Figure 5.2: Benefit distortions and stabilization 


This fundamental argument for stabilization policy is in principle 
also an argument for a change in the benefit system. Of course, if for 
some reason the benefit system cannot in practice be changed, then 
stabilization policy is justified on the ‘second best’ grounds we have 
described. But this does not imply that one should uncritically accept 
the system and the costs it incurs, even with stabilization policies to 
mitigate them. 


OPTIMAL ECONOMIC POLICIES AND TIME INCON¬ 
SISTENCY 

A crucial feature of an economy with rational expectations is that current 
outcomes and the movement of the system’s state depend in part upon 
anticipated future policy decisions. 

The same feature will obviously not occur when expectations are 
formed in a backward-looking manner, since in these circumstances cur¬ 
rent outcomes and the movement of the system’s state depend only upon 
current and past policy decisions and upon the current state: they need 
to allow for the full effects on private behaviour of people’s expectations 
about the outcomes of the rules (we discuss this issue, often known as 
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Lucas’ Critique, in chapter 6). However, having computed the rules ap¬ 
propriately, policy-makers face a further difficulty — or opportunity — 
if they are not bound in some way to stick to the rule they so compute 
but people expect them to stick to it nevertheless. For after one period 
of operating this rule, people have made certain commitments, expecting 
the rule to prevail; the policy maker now faces a new situation, in that 
he can exploit these commitments in making policy afresh. Such policy 
behaviour is known as ‘time-inconsistency’. 

Of course, matters will not rest there, since the people, thus tricked 
and exploited, will be less willing to believe that the fresh policy rule 
will prevail without yet further twists. There is also the question of 
whether people would have been willing to believe the original policy 
rule in the first place if the policy-makers could wriggle out of it. Finally, 
we may ask why policy-makers should wish to trick people in this way 
‘for the social good’ — on the face of it, it is peculiar that social good 
can be achieved by trickery. However, we defer these developments of 
the argument until after an explanation of the mechanism behind time- 
inconsistency. 

Kydland and Prescott (1977) showed how, in a dynamic economic 
system in which agents possess rational expectations, the optimal policy 
at time t = 0, which sets future paths for the control variables (taxes, 
subsidies, monetary growth), implies values of the control variables at 
some later time t+i, that will not be optimal when policy is re-examined 
at t + i, even though the preferences of agents are unchanged. This they 
called the time inconsistency of optimal plans. 

Two examples (given by Kydland and Prescott, 1977; Fischer, 1980b) 
relate to examinations and patent policy. Optimal policy at the begin¬ 
ning of a course is to plan to have a mid-session exam. However, on 
the morning of the exam when all student preparation is complete, the 
optimal policy is to cancel the exam, saving the students the trouble 
of writing and the lecturer the trouble of grading. Optimal policy may 
analogously be to withdraw patent protection after resources have been 
allocated to successful inventive activity on the basis of continued patent 
protection. 

The argument’s formal structure can be seen in a simple two-period 
problem presented by Kydland and Prescott. Let 7r = (7Ti, 7^) be a 
sequence of policies for periods 1 and 2 and x = (x\, X 2 ) be the corre¬ 
sponding sequence for economic agents’ decisions. The social objective 
function is given by 

S[x i,x 2 ,7ri,7r 2 ) (10) 

Consider first the situation with backward-looking expectations, wh¬ 
ere no time inconsistency arises. For simplicity we will assume an absence 
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of stochastic error terms, which does not affect the argument. In this 
case the model is: 


Xi = X\ [lX\ ) 

(11) 

x 2 = a; 2 (xi,7r 2 ) 

(12) 


The policy maker maximizes (1) subject to this model, meeting the 
first order conditions of a maximum (we assume the S function is well- 
behaved so that second-order conditions are met): 

Q= dS_ = ( 8 S_ 8 S_ dx 2 \ daq 8 S_ 

cbr-j yctei 8x2 8x\) 8 tx± dni 

n _ _dS_ _ dS_ dx2_ , dS_ 
d'K 2 dx2 ’ d-K2 ' 9tz2 

The solution of (2) to (5) defines the optimal values of 7ri and 7r2 (and 
the implied x\ and X 2 ) which the policy maker chooses at the beginning 
of period 1. At the end of period 1, x\ and 7Ti have occurred, exactly 
as the model predicted and the policy maker intended, respectively. So 
at this point, were he to recompute his optimal second period policy 7r 2 , 
he would simply solve (3) and (5) for 112 and X 2 - Since x\ and 7Ti are 
as implied by the complete four-equation solution earlier, the truncated 
solution for 1 x 2 and X 2 conditional on them will also be the same as 
before (any reader doubtful of this should check this result with any set 
of four simultaneous equations). So the policy maker will automatically 
continue with his planned policy under the case of backward-looking 
expectations. 

Now consider how rational expectations affect the situation, under 
the (artificial) assumption that people believe the policy-maker’s an¬ 
nounced plans. The model now changes because the outcome in period 
1 is affected by expectations of period 2 policy 

xi = £1(71-1 , 7t 2 ) (14) 


x 2 = x 2 (x 1 ,tx 2 ) 


(15) 


The policy maker’s maximization at the start of period 1 is now given 

by 


as _ 

f 8 S 

8 S 8 x 2 ^ 

1 8 x 1 

8 S 

1 _ 

8tx\ 

V&ci 

8 x 2 8 x 1 ) 

' 8ix 1 

8 ix\ 


(16) 


dS__dS_dx 2 dS_ 8 S_ dS_dx 2 &ci 
81 x 2 dx 2 dix 2 81 x 2 dx\ 8 x 2 dx\ 81 x 2 


(17) 
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Equation (8) is the same as (4) under adaptive expectations, but (9) 
adds to (5) the last term, which reflects the public’s rational anticipation 
of 7T2 and its effect on period 1 events, X\. The policy maker obviously 
wants that effect to be optimal. 

At the end of period 1, when X\ and -k\ have become history, the 
policy maker can re-optimize his setting of 7T2 and, if he does so, his 
appropriate first-order condition is: 

8S = 9S^ as 

dlT2 dX2'0 tT 2 dlT2 

which reverts to (5). In other words, having influenced expectations in 
period 1, he now can neglect any effect of 7 t 2 on the x\ outcome (just as 
was the case throughout with adaptive expectations). He solves for 7r2, 
X 2 from (7) and (10), given m, X\ (and previously planned 7T2, X 2 ) from 
(6) to (9). 

Obviously the solution for 7T2, X 2 will be different from the previous 
plan (it would be the same if and only if (10) were identical to (9)). 
Hence his optimal plan is necessarily time inconsistent. 

In macroeconomic policy time inconsistency arises because of the 
desire among policy makers to exploit the Phillips curve short-run trade¬ 
off between output and inflation. If they merely carry forward the money 
supply plans everyone expects, output will be at the natural rate, y* (as 
will unemployment). However, they would ideally prefer higher output 
(as stressed by Hillier and Malcolmson, 1984) because of the presence 
of distortions (such as unemployment benefits discussed earlier) which 
mean that private decisions generate too low an output and employment. 
So it is optimal to raise money supply growth (and inflation) above its 
expected level to stimulate output. 

Take the following example, using a heavily simplified macro model; 
the example originally comes from Barro and Gordon (1983). Let the 
government set money supply growth, g tl and assume that inflation is 
equal to this (one can think of this as the government using the money 
supply to target inflation directly). Output is determined by the Phillips 
curve: 


yt = y* + 0(gt - gt) (19) 

is people’s rational expectation of money supply growth formed before 
the t-period opens. 

The government’s (and also the social) utility function is: 

Ut = ~{yt — ky*) 2 — s(gt — g*) 2 (k > 1) (20) 

ky*(> y*) and g* are the government’s ideals for output and inflation, 
respectively. 
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Suppose the government announces its planned money growth in t— 1; 
it is believed and therefore equals g\ r. But then it is able to change its 
mind at the beginning of t, after has been set, so that a different g t 
can be delivered. The set-up is quite artificial, as we pointed out earlier 
and will shortly discuss. 

Before the period the government maximizes U t with respect to g t , 
subject to the Phillips curve and g t = g\. In this case y t = y* and the 
maximum is: 

0 = ^ = -2 s(g t -g*) (21) 

og t 

whence g t = g* ■ 

However, once period t has begun, the government can re-optimize, 
maximizing U t with respect to g t again subject the Phillips curve, but 
now given gf = g*. Substituting for y t from (12) in (11) gives: 

Ut = -[(1 - k)y* + 9(g t - g*)] 2 - s{g t - g *) 2 (22) 


Maximizing this we get: 

3U 

0 — = -2[(1 - k)y* + d(g t - g*)}9 - 2s(g t - g*) 

ogt 

so that 


^ i 

9t = g + 


0(k-l)y* 
s + 9 2 


(23) 


(24) 


Actual money supply growth is more stimulatory than expected as 
the government exploits the Phillips curve. Figure 5.3 illustrates this. 

The concentric ellipses in figure 5.3 are the government’s (society’s) 
indifference curves, with B denoting its ‘bliss point’. PlPl, P s P s are 
respectively the long-run Phillips curve and the short-run, conditional 
on gf = g* (we will return to P' S P' S and point c). Point a is the optimal 
plan before period t opens. Point b is the optimal action after it opens. 

As our heavy earlier hints have indicated, there is an artificiality in 
the assumptions which permit the government to fool rational private 
decision makers in this way. Even though it is for the general good, 
private individuals will try to avoid being fooled in this way, because as 
individuals that is their best decision. That is precisely what a distortion 
involves: the outcome of best private plans does not coincide with the 
social optimum. 

Another way of stating this is to say that individuals will, in forming 
rational expectations, take account of the government’s incentive not 
to carry through its announced plan. If people know the model, then 
they know the government’s utility function and that it will maximize it. 
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Figure 5.3: Time-inconsistent exploitation of the Phillips curve 


They will be in a position to work out what the government will actually 
do as opposed to what it says it will do. 

In our example, this has radical results. People form their expecta¬ 
tions, gf, by working back from what the government will do in period 
t given those expectations. The government will maximize U t subject 
to the Phillips curve and gf whatever it is. Substituting (11) into (12) 
gives: 

Ut = -[(1 - k)y* + 6{g t - gt)f s[g t - g*} 2 (25) 


and setting = 0 solves for: 


9t = 


sg * +0 2 g e t + 9(k - 1 )y* 
s + 9 2 


(26) 


This is the rule people know the government will actually follow, re¬ 
gardless of its pious intentions. Therefore to form a rational expectation 
in period t— 1 people take expectations of (18): in so doing they ensure 
that gl = E t -igt- Therefore they solve: 


Et-igt 


sg * +6 2 E t -ig t + 9{k - 1 )y* 
s + e 2 


(27) 


from which it emerges that: 


9t = Et-igt = g* + 


0(k - 1 )y* 


s 


(28) 
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When the government comes to execute its optimal rule (18), what¬ 
ever it may have announced in advance, it will find that it will generate 
money supply growth exactly as people expected, g t = gl as in (20). 
This can be established by substituting (20) for g\ into (18) and solving 
for g t . 

In figure 5.3, this is point c, where y t = y* on the long run Phillips 
curve. Yet the government has reached a tangency point of an indiffer¬ 
ence curve to the short run Phillips curve, P' s P' s . through that point. 

What we see here is a combination of two factors. First, rational 
expectations is ensuring that in the absence of shocks people are not 
surprised so that their private output plans are not frustrated: y t = y* 
on the long-run Phillips curve and gt = E t ~igt- This is a restoration of 
Sargent and Wallace’s principle of policy ineffectiveness in the context 
of a standard Phillips curve. 

Secondly, the government is pursuing a particularly destabilizing rule 
for inflation. Instead of responding to past output, itself a rule that 
destabilizes money growth and inflation in the presence of shocks (but 
not here where shocks have been assumed away), it is responding to ex¬ 
pectations of inflation in the attempt to stimulate output above its nat¬ 
ural (inevitable) level. In other words, whatever money supply growth 
people may expect it to inject in an attempt to stimulate the economy, 
it goes one better in its attempt to reach tangency with the short-run 
Phillips curve. The process is only arrested by the escalating costs of 
inflation. It can be seen from figure 5.3, where the vertical distance 
between points b and c is given by (.k — 1) s ( s %^) , that as the cost °f 
inflation, s, tends to zero, point c tends to infinity. 

Consequently, on top of any moderate destabilization of inflation that 
may occur if it also attempts to respond to past output, the government 
may produce a very large destabilization of inflation through this type of 
response. The solution forced by the conflict between the government’s 
determination to reach tangency with the short-run Phillips curve and 
the people’s rational expectations is the point c where both are finally 
satisfied. 

In this case the ‘time-inconsistency’ of the government’s optimal 
plans has been marginalized by people’s rational expectations, because 
any ‘plans’ the government may form before period t opens are irrele¬ 
vant. The only ‘plan’ of any interest is the computed outcome of the 
government’s true rule. If we refer to this as the government’s true plan, 
then we can say that it is time consistent, because it will be followed 
through in period t. 

Clearly, this time-consistent result of the interaction between the 
government’s rule and people’s rational expectations delivers poor social 
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utility. If somehow the government could achieve point a (the result of 
its original time-inconsistent optimal plan, had it been carried forward), 
that would be better, since it would achieve lower inflation together with 
y t = y* as in c. Point b is better still but no longer available because of 
rational expectations. 

In terms of the general two-period problem set out in the model of 
equations (1), (6) and (7), the equivalent of point a is the solution of 
equations (8) and (9) together with the model where the government 
determines its optimal plan assuming its own credibility. The equivalent 
of point b is the solution of (10) and (7) given x\ and tt\ as determined 
by this original optimal plan and the full model — here it ‘backslides’. 
The equivalent of point c is given by the joint solution of (10) and (18) 
with the model, where its future backsliding is fully taken into account in 
its decision for first-period policy and in the behaviour of private agents. 

We have now exposed the artificiality of our original assumption that 
people automatically believe the plans the government announced at the 
start of the first period. In fact they will only believe them if the govern¬ 
ment commits itself in advance to carry them out, on pain of a penalty 
at least as great as any gain from backsliding. This penalty has to be ad¬ 
ministered by some agency outside the government’s control: otherwise 
the government could abrogate the penalty. This incidentally creates a 
difficulty for a sovereign democratic government: if it is sovereign, may 
it not refuse to submit to any penalty, even one it agreed to? Clearly if it 
does not surrender its sovereignty over this matter then it cannot enjoy a 
point such as a in figure 5.3. Without an effective and sufficient penalty, 
the only possible point is c: not only b but also a is ruled out. (Point c, 
the ‘time-consistent’ policy case in this literature, would in the literature 
of contracts be called the ‘self-enforcing’ or ‘incentive-compatible’ case. 
In effect if no contract can be enforced against the interests of one party, 
then the only possible contract is one that this party will carry out any¬ 
way — Hart, 1983. The parallel here in policy is that no enforceable 
contract specifying a penalty can be drawn up with the policy maker.) 

Since a is preferable to c, it is in the interest of policy-makers to find 
a mechanism for effective commitment. Various mechanisms have been 
suggested. The most widely canvassed is a constitutionally independent 
central bank such as the Bundesbank (set up after the Second World 
War) in Germany and the Federal Reserve Board in the USA; we shall 
explore this carefully below. Another is to join a currency board system 
of fixed exchange rates (as in the sterling area) whereby one’s money 
supply is controlled by the dominant currency of the system: the Euro¬ 
pean Monetary System evolved into the euro-area partly for this reason. 
In the case where the government itself controls its own money supply 
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within a floating exchange rate system, the British government since the 
Second World War being one example, it may seek to create political 
embarrassment from backsliding. 

The Thatcher government’s Medium Term Financial Strategy - 
MTFS — was of this type. It set targets for the money supply and 
public sector deficit for a rolling four-year period: these targets and the 
associated commitments to reduce inflation were emphasized by the gov¬ 
ernment as a litmus test of its fitness to govern. The implication was 
that the public should not re-elect it if it failed: clearly a dangerous 
commitment to fail on from a political viewpoint. 

These mechanisms all have the key deficiency that governments are 
sovereign and cannot be bound by past commitments; therefore they will 
naturally re-optimise and whatever hurdles are put in their way they will 
contrive to leap over them — thus the time-inconsistency problem reap¬ 
pears (McCallum, 1995). Democratic governments for example have the 
power to change constitutions or to leave currency boards. They can 
plainly run public sector deficits, which ultimately may drive them to 
print money to avoid the rising cost of debt finance: central bank ob¬ 
struction in these circumstances will be over-ruled, with the ultimate 
threat of removing the bank’s independence. The Thatcher MTFS con¬ 
fronted the issue of deficits, and by making the public its judge also 
avoided the constitutional issue. The question still remained over how 
far it could get away with backsliding: for example, could it not have 
tried to persuade the public at an election that somewhat higher inflation 
then planned was for their good because it allowed lower unemployment 
(point b in figure 5.3)? 

This deficiency implies that there is with each of these mechanisms 
some probability of effectiveness, some of backsliding; let us leave on one 
side for now where these probabilities might come from and how they 
might be assessed. The resulting solution will reflect these probabilities. 
Let us pursue this in the context of our inflation-unemployment trade-off 
example. 

Suppose the government commits incompletely to a plan of g t = g* so 
that there is a probability, 7r, of its pursing a backsliding policy g\ (that 
is, maximizing Ut given whatever expectations, g%, have been formed on 
the basis of its ‘commitment’). The consequential expectations, g ^, will 
be: 


9t = ^9t + (1 - ir)g* (29) 

To calculate g\, we substitute for g\ in U t obtaining 
U t = -[(1 - k)y* + 6{g t - irg b t - (1 - ir)g*)} 2 - s(g t - g*) 2 (30) 
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so that its backsliding strategy is given by setting 0 = which yields: 

6 * , e(k-l)y* 


9t = 9 + 


S + 8 2 ( 1 — 7T) 


(Our procedure, it will be recalled, is to maximize with respect to gt, 
given then to set g\ = in the resulting first-order condition, and 
solve for g\.) 

It follows that: 



n9(k — 1 )y* 
s + 6 2 { 1 — 7r) 


(32) 


The resulting situation is illustrated in Figure 5.3 by the Phillips 
curve P s ’P s ’and is worked out in Box 5.1. The outcome will either be d 
if the government backslides or e if it does not. Point d is clearly better 
than c (it may or may not be worse than a, depending on the size of 
s, the government’s dislike of inflation and the extent of credibility in 
the uncertain case); backsliding when you have uncertain commitment 
is better than when you have none, but you might have been better off 
with totally certain commitment. 


Box 5.1 

THE VIRTUE OF (EVEN SOME) CREDIBILITY 

When credibility is total, then 

9t = = 9* 

and 

U t = -[(1 - k)y*] 2 = U a 

i.e. utility in case a in Fig.5.3. 

When there is no credibility (case c), we obtain 

d2 

U c = U a ( 1 + -) 
s 

After laborious substitutions we obtain for the backsliding case d 
under imperfect credibility 

u d = u a {— S ^ S 2 + 0 ^ -) 

V (s + 0 2 [1-7t]) 2; 
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and in case e (follow-tlirough with g*), 


U e = U a ( 


s + 9 2 [1 — 7r] 


) 2 


Comparing U e with U c and Ud with U a , we obtain in both cases an 
ambiguous ranking: 


U 1= Ud = s(s + 9 2 ) 

Uc Ua (s + 0 2 [1 — 7r]) 2 


Subtracting the denominator from the numerator yields: 


<9 2 [s(27t-1)-<9 4 (1-7t) 2 ] 


so that U e y U c and U ( j y U a if for example 7r is high or if 7r >- 0.5 
and s is high or 9 is low (it requires that n y 0.5). 

The probability-weighted average of Ud and U e is 

U w = 7 rU d + (1 - n)U e = U a ( S +f ) 

s + 9 [1 — 7rJ 

which is worse than U a and better than U e . 


Point e, following through on uncertain commitment, though clearly 
worse than a and d, may or may not be greater than c. However the 
expected (probability-weighted) utility of points d and e is better than c 
and worse than a. It follows that some commitment, however uncertain, 
is better than no commitment at all, because it gets the government on 
to a lower Phillips curve, so a better trade-off: expectations of money 
supply growth, gf, are reduced. 

Uncertainty about whether the government will backslide for some 
reason is unavoidable given its sovereignty. The essential source of the 
uncertainty is whether the penalty triggered by backsliding (over-riding 
the central bank, being politically embarrassed by higher inflation, etc.) 
will be bigger than the gain from it (lower unemployment, higher output 
at the expense of somewhat higher inflation). 

As we have seen, it pays the government to make a tough commitment 
to g*. But no government can bind its successor, or even itself at a future 
date. If its preferences change (a new government or new ministers), or 
if circumstances change (an unexpected fall in y*, for example), the 
penalty may not be sufficient. 

Modelling this uncertainty is clearly difficult. Various attempts have 
been made. One, initiated by Backus and Driffill (1985 a,b), has focused 
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on uncertainty about government preferences. The government may turn 
out to be ‘wet’ or ‘dry’ (respectively caring about both inflation and 
unemployment, and caring only about inflation): the probability, n, of 
backsliding (the effect of ‘wetness’) is constantly re-assessed in the light 
of policy actions. 

The difficulty with this approach is to motivate, first, these differences 
in preferences and, second, the private sector’s lack of knowledge about 
them. If we probe more deeply into the basis of politics — we do more 
on these lines in Chapter 8 below — we can see that parties wish to win 
votes, by appealing to popular preferences (those of the median voter), 
and they may also have ‘ideology’ (that is beliefs about what should be 
done allied to the belief that they can make this attractive to voters). 
Possibly we could define a wet government as one that wished to appeal 
to the median voter; a dry one as one that ideologically favoured zero 
inflation regardless of short-term popularity. Yet this raises the question: 
if voters truly reward reflationary time-inconsistency at the polls, would 
it be rational for a government to be dry, so guaranteeing loss of office? 

Box 5.2 

UNKNOWN PREFERENCES AND REPUTATION 
— A SIMPLE EXAMPLE 

A government gets elected for a two-period term (a ‘period’ lasts two 
years). There are two sorts of politician: ‘wet’ with utility function 
(seen from the start of any term of government they might have) 

U w =~n*+ a(y t -y)+ <5[--tt? +1 + a(y t+1 - y)\ 
and ‘dry’ with utility function 

Ud = ~ 1 

where in both cases 6 >- 1 (because the second period is closest to 
the next election). The economy consists of a Phillips Curve 

y t = b{ 7 T t - 7 r t e ) 

and 7 rf is formed rationally with information at t — 1 (the log of the 
natural rate, y*, has been normalised at 0). The private sector is 
assumed to know nothing at all about the politicians’ preferences, 
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so that the initial probability of the government being wet, prob w = 
0.5; it then looks for clues from government behaviour. 

The dry-politician government would maximize its utility = 

= 0) by setting n t = n t +i = 0. The wet-politician government 
faces a choice. If it chooses nt >- 0 in t, then the private sector 
will know in forming period t + 1 expectations that the government 
is wet ( prob w = 1); if it chooses 7r t = 0, the private sector will 
still be in the dark and prob w = 0.5 still, in period t + 1 it no 
longer matters whether the private sector knows because the period 
after the election (t + 2) does not count: politicians only care about 
winning the election. Hence maximizing U w with respect to 7r t+1 
yields simply 

0 = 7t~ - = -7r t+ i + abornt+i = ab 
OTTt+l 

Maximizing U w with respect to nt involves choosing the highest of 
two utilities, cases A and B: 

(A) if 7r t = 0: = —abn% — ay + <5[— \[ab) 2 + ab{ab — 7r^) — ay] 

(B) if 7r t y 0, then the wet government will choose the nt that 

maximizes u w , that is 0 = = —nt+ab so nt = ab (remembering 

that 7Tj, 1 will be invariant to nt if 7r* >- 0 .) Its utility will then be: 

Uw = ~^( ab ) 2 + ab (ab - O - ay + 6{-^{ab) 2 + ab{ab - n^) - ay] 

Subtracting from gives us: 

Uw ~U* = ~{ab) 2 + {ab) 2 - 6ab{n e t £ - 7r^) 

which clearly depends on private expectations formed in t for t + 1. 
In case A, people will still not know the government is wet and so 
7r^ = 0.5a6; in case B people will know and so n f t +\ = ab. So 

Uw - U£ = \{ab) 2 - 8ab{\ab) = \{ab) 2 { 1 - 6) A 0 

Hence it pays a wet government to pretend it is dry in period t: 

7Tt = 0, so ‘building reputation’ - before exploiting its reputation in 
the run-up to the election with a pre-election reflation. We can now 
incidentally work out nf . Since people know that a wet government 
would choose this strategy, of zero inflation in the first period, there¬ 
fore n\ = 0; both governments would therefore choose zero inflation, 
leaving people no way of distinguishing when forming expectations 
for the second period. 
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This is known as a situation of ‘pooling equilibrium’ in period t 
because both types of politician choose the same, ‘pooled’, outcome. 
However, the set-up assumes that the dry government is indifferent 
to being indistinguishable from a wet government in the first period. 
This is true above because the dry government’s utility is unaffected 
by yt+i and so by nf +1 . 

Yet this is improbable since in winning the next election the dry 
government would be helped by lower unemployment and so by a 
lower 7 T( + 1 . We can allow for this by letting its preferences now be: 

Ud = Aft + + a(y t+1 - y)] 

where a -< a 

In this, much more complicated, case the dry government also has 
a choice: it can choose 7 iy = 0 as before, knowing a wet government 
would match it or it can choose a 7 r t that is so negative in the first 
period that a wet government would find it too expensive to pre¬ 
tend and match it. If the dry government were to choose the latter 
strategy, we would have a ‘separating equilibrium’ where the wet 
governmentr does not pretend and hence cannot ‘build reputation’; 
instead the private sector knows at once what sort of government it 
faces. 

Consider the possibilities for the dry government: 

(i) It chooses TT t = 0. By so doing it permits a wet government to 
pretend, as we know from the above steps it would. 

(ii) The dry government chooses 7 iy -< 0, such that the wet govern¬ 
ment would find it not worth pretending. To find out how negative 
TT t , would have to be we check out the critical n t just above which 
the wet government gives up pretending, call this n*. 7 rjf is such 
that U£ = U£ 

So if 7 T wt = 7 Tj, then 7 r^ = ^ab and 

Uw = -^(O 2 + ab('K* t -it e t ) - ay + 6[~(ab) 2 + ab{ab - ^ab) - ay] 

If 7 v wt = ab , then 7r^ = ab and 

u w = -]^{ a b) 2 + ab{ab --K e t ) - ay + 6[-^(ab) 2 - ay] 

Setting U^ and cancelling terms gives us: 

7 r* 2 - 2abn* t - ( 6 - l)(a&) 2 = 0 
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the unique negative solution of which is: 7rjf = ab(l — VS) 

We now ask whether it would pay the dry government to choose this, 
so forcing the wet government to stop pretending, or instead choose 
7T t = 0, so allowing it to pretend (for the dry to choose anything 
in between would be the worst of both worlds since it would suffer 
from negative inflation as well as permitting pretence). We compare 
(7r t = 0) with U£ (n t = 7rjf), so that 77^ = \ab and 77^ = 0. 
We obtain: 

U* d -Ui = \{abf[^-{l-V6f] 

Considering this problem we can see at once that we require 8 >- 
1 and a y 0 for a separating equilibrium to be possible. 8 = 1 
implies 7rjf = 0: a = 0 implies that >- U ( f. Fig. 5.4 below 
shows the conditions (for 8 >- 1) under which we get a separating 
equilibrium: where — >- • For example, assuming 8 = 2 (the 

pre-election performance is twice as important for the election as 
the post-election one), then a needs to be only 9% of a; so the dry 
government needs to care about pre-election unemployment only a 
very little relative to a wet one. 



Figure 5.4: The Backus-Driffill model 
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As for revelation of preferences, Vickers(1986) showed that in many 
circumstances it would pay the dry government to signal its nature by 
some extreme act of dryness that a wet government would find too costly 
to imitate. In this case there would be a ‘separating equilibrium’ in which 
wage bargainers would know exactly what sort of government they faced. 
This question is explored in the example in Box 5.2. 

Another approach, initiated by Barro and Gordon (1983), assumes 
that preferences are known and constant and that the penalty for back¬ 
sliding is a ‘loss of reputation’, whereby the private sector (wage bargain¬ 
ers) disbelieve the government’s announced plans for a certain period of 
time and assume the government will backslide during this period. The 
penalty is arbitrary in the Barro and Gordon model but we could reduce 
the arbitrariness by assuming for example that private disbelief contin¬ 
ues until there is a change of membership in the government (e.g. an 
election cycle). 

During the period of lost reputation, the government is considered 
to suffer the point c. This penalty is in general less than the gain to 
backsliding on g* as an announced plan. People work out the announced 
plan (gl‘ > g*) such that the penalty is just sufficient to offset any gain 
from backsliding. This is of course the plan that the goverment will also 
choose. It produces a point on the vertical Phillips curve between a and 
c. This is worked out in Box 5.3. 


Box 5.3 

REPUTATION THROUGH PRIVATE SECTOR 
WITHDRAWAL OF BELIEF 

The government has an infinite-horizon utility function at t: 

OO 

u t = 6t {-* n t+i + a (vt+i - y)} s -< 1 

2 — 0 

and faces a Phillips Curve 

Vt = b(n t ~ O 

TTf is formed rationally at t— 1 for t according to the following agreed 
private sector one-period punishment strategy: 

(1) if 7Tt-i >~ 7Tt_i; 7r t — a b (the government’s discretionary chocie) 

(2) if 7r t _i = 7r®_i,7r® = 7r“, acredible government target announced 
in t — 1 for t, t for t + 1 etc (provisionally assumed constant). 
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Under (1) the government ‘forfeits credibility’, under (2) it ‘achieves 
credibility’. 

We start this game off by assuming that 7T_i = n e _i. 

Knowing the private strategy the government chooses a plan at —1 
for t > 0. It has two basic strategies to compare: 

(a) It can announce and deliver 7r“ throughout - a permanent follow- 
through strategy. 

(b) It can cheat in even periods and suffer punishment in odd periods 
(cheating in odd ones must be worse than this because the gains 
are delayed). When it cheats it will choose n = ab because this 
maximizes its (cheating) utility as we have seen. 

Under (a) its utility is: 


Ut= j ~ ay\ 


Under (b) it is: 


OO 1 oo 

u t = P{-y(ab) 2 +ab{ab-n a )-ay}+ ^ 8 l {--(abf - ay 


i(even) 

Remembering that 


i(even) 

and analogously 


oo oo i 

V V = i ~r~b' ■ ... v s 2i = -— 


£ 4 ‘ = r 

i(odd) 


we obtain 


Y ~^2 [^( o6 ) 2 ( 1 -S)-{1 + 6)ay - abn a ] 


whence (remembering = (1 _ g) 1 (1+g) ) 


U? - U b 


2(1 - 6 ) 


2ab _a , C 1 

T+8 + (TT6) (a6) } 


The situation is illustrated by Figure 5.5 which shows U a and U b 
as functions of 7r°, the credible target. U a = U b at two points: n a = 
0i> i+g^ ’ ab (H lese are found as the solutions to the quadratic in 
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7 T a that comes from setting U a = U b ). Any point to the left of the 
first is not credible because it would pay the government to cheat. 
Any point between the first and second provides less utility than 
the first and so will not be chosen by the government. Therefore the 
first point, is the only feasible one: at it the government will 

never cheat, because the loss of utility from losing credibility next 
period just offsets the gain from cheating. Plainly 7r° is a constant 
as assumed (as can be checked by seeing that this holds in every 
period). 



Figure 5.5: The Barro-Gordon model of credibility 

This idea on its own simply works out what would happen if the 
penalty for backsliding on a plan were less than the penalty needed to 
prevent backsliding on g*. The result is a steady gf = g t > g*. The 
government would in fact be better off if it could somehow persuade 
people to disbelieve it for longer! 

Uncertainty of a sort can be introduced by introducing shocks to the 
economy’s natural rate or to the government’s preferences which cause 
the government to choose a new g | equilibrium, after initially reneging. 
As we have seen it will always pay the government to choose a permanent 
non-punishment equilibrium as soon as it can. If g t > , the government 

loses credibility: future g t defaults to point c until credibility is restored. 
It then reverts to gt = gf > g* until the next shock producing g t > g ^. 
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The difficulty with this approach is how to define and motivate a co¬ 
ordinated, non-arbitrary strategy on the part of private agents for ‘trig¬ 
gering’ belief or disbelief — their ‘punishment strategy’. These agents, 
it must be remembered, are not voters here, but wage bargainers. Why 
should they collectively come together to agree a punishment strategy 
instead of getting on with their own private business? One can think of 
reasons if workers are represented by a single union or a well-integrated 
set of unions; and this case has been explored by Herrendorf and Neu¬ 
mann (1998). But in general the case is unconvincing. 

We have therefore reviewed two approaches to uncertainty — one 
emphasizes the uncertainty of government preferences, the other that of 
the changing environment. Both are attempts to model the evolution of 
credibility, viewed as the possible selection by the government of either 
low inflation or reflationary policy. However both have arbitrary ele¬ 
ments in them which make them unconvincing in general as an account 
of why governments choose to inflate or not. 

More recently economists have returned to the original Kydland- 
Prescott model and attempted to embed it in a rational political process. 
There have been two main strands of thought. 

The first is an institutional model of politics. In this approach it is 
assumed that institutions matter and do in practice constrain govern¬ 
ments, including democratic ones. So it is accepted here that govern¬ 
ments will be prone to time-inconsistency, implicitly because this will 
appeal to voters whose preferences are such as in the Barro-Gordon set¬ 
up. But there are difficulties for governments in overturning institutional 
barriers — they are for example seen as ‘high-handed’ or ‘arbitrary’ and 
therefore governments like to be seen as acting within the conventions; 
even if they could overthrow them, they will not do so lightly because it 
requires costly legislative time and ministerial energy. 

These authors (the earliest of whom is Rogoff, 1985) suggest that the 
electorate may well accept some institutional reform, if it can be shown 
to have good long-term effects. Then the reform once made it will be 
respected by future governments. They suggest therefore establishing 
the institution of an independent central bank, with a remit (or ‘insti¬ 
tutionalized preferences’ which could be summarised in the degree of 
‘inflation toughness’ of the Governor) to give more attention to inflation 
than would be given by politicians reflecting popular preferences. 

The earlier work gave no real role to stabilization since there were no 
shocks shifting the Phillips curve itself; only demand shocks introduced 
by monetary policy and inflation and causing output to move up and 
down the Curve. In this revival of the idea of Bank independence Phillips 
curve shocks (which could come from supply or from demand via other 
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elements entering the curve, such as the exchange rate or real interest 
rates driving inter-temporal substitution) were introduced so that the 
government has another reason for varying inflation besides the desire 
to raise output above the natural rate: to stabilize the economy in the 
face of shocks. 

The resulting set-up is illustrated in figure 5.6 (the same as figure 5.5 
above, except that the Phillips curve is now stochastic). The voter, as 
represented by the government, has ‘normal’ preferences with a dislike 
of both inflation and unemployment. Unrestrained, these would produce 
the poor time-consistent outcome c on our original figure — high average 
inflation; however it would also stabilize shocks as shown at points / and 
g. Reflecting on their institutions however voters realize that this sys¬ 
tematic result of the political process is not as good as could be achieved 
if institutional restraint was exercised through an independent central 
bank that was free to react to shocks and yet limited in its tolerance 
of inflation. (One might ask here why not look at other constitutional 
restraints? The answer would be that a purely constitutional law would 
lack the necessary flexibility Since we are setting the institution up for 
the long term we are concerned about its effect on expected (average) 
future social (i.e. the voters’) utility. We vary a until this is maximized; 
it turns out (see Box 5.4) that this occurs somewhere between 0 (total 
intolerance of inflation) and popular preferences. Inflation would be zero 
at the first but stabilization would also be nil; at the second the inflation 
bias would be at its worst but stabilization would be at its most effective. 
A compromise is necessary. 

Box: 5.4 

THE INSTITUTIONAL APPROACH TO THE IN¬ 
DEPENDENT CENTRAL BANK 

Suppose society has the utility function: 

u s = +o‘ivt -y) 2 } 

and faces a Phillips Curve 

Vt = c{ir t - 7 T e t ) + u t 

where Ut consists of shocks on the Phillips curve from supply and 

also demand (via real interest rates or exchange rates which enter the 
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inflation/labour supply process) with variance er 2 . The institutional 
reformer wishes to choose a central bank set of ‘preferences’ (short¬ 
hand for the remit given in the central bank legislation) in order to 
mazimize society’s expected utility (i.e. the average over the future 
- long - period during which the legislation will hold), that is 


EU„ = --[Et$ 


aE(y t - yf 


To motivate the set-up consider two extreme instructions to the 
central bank which we summarise as giving the central bank a set 
of preferences of the form: 

(i) not to care at all about unemployment (a = 0) 

(ii) to behave as if it had the same preferences as society (a = a) 
Using the central bank’s first order conditions, 

under (i) n t = 0 = and so EU S = — ^a[y 2 + a 2 } 

under (ii) n t = ac ■ It follows that 7T® = acy so that 

7r t = acy - and y t = j^iu t so EU S = -\{g 2 (a + a 2 c 2 ) + 

(TT^y 2 } 

Compared with case (i), case (ii) offers a clear reduction in the 
effects of shocks due to the Bank’s active stabilisation response, but 
a rise in the effect of y (the ‘distortionary gap’ between desired 
full employment output and its natural rate), due to the Bank’s 
frustrated attempt to stimulate output beyond its natural rate; this 
latter being the effect on expected utility of the inflation bias. 

We can now generalise the problem and find a compromise Bank 

preference for lower unemployment, letting 0 -< a -< a 

This will produce analogously from the Bank’s first order condition 

ac 

7r t = acy - — - T^ut 

1 + ac- 


and society will enjoy 


EUs ~ 2 ~ J * ' (1 + ac 2 ) 2 " / 

From this we can construct figure 5.7 where we show how EU S varies 
with a. As a —> a, the effect of the shocks falls as stabilisation rises 
but the inflation bias rises too. The two together reach a maximum 
at a. From 0 = d ^ s we find the equation for a is: 


a 2 c 2 ]y 2 


a + a 2 c 2 


a( 1 + ac 2 ) 3 
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which reveals that a rises as a 2 rises relative to y 2 . The reason is 
that the bigger the shocks are relative to the distortionary gap, the 
more the gain to stabilizing relative to the cost in higher inflation 
bias. 

All this assumes the central bank’s remit must be to maximize 
something of the same form as society’s utility. But of course insti¬ 
tutional design need not be restricted to this. Walsh (1995) pointed 
out that one could add a linear term to ub', selected so that when 
this was maximized, inflation bias could be eliminated but optimal 
stabilization could be achieved. Let Ub = + a[y t — y] 2 ) — Xir t 

and set A = acy. The first order conditions now yield: 


7Tt = 


Vt = 

EU S = 


ac 


1 + ac 

1 

T u t 


U t 


1 + ac A 

-i[M7 2 + y 


a 


ac* 


which represents the first best result for society (shown as F on the 
diagram). 

Some other ways of getting the same result are to adjust Ub by 
inserting an inflation target, 7r*, that is artificially low or by setting 
a side-condition that inflation not be varied from its proper target 
except in response to shocks. These results can be extended to 
the more realistic case where output is persistent; in this case the 
optimal contract contains state-contingent restrictions — see the 
Appendix to chapter 11 on dynamic programming. 


Recently it has been realized that institutional preferences 3 can be 


A number of authors have also considered objective functions that are non- 
quadratic in inflation and output deviations. For instance Orphanides and Wilcox 
(1996) consider the opportunistic approach of the central banker. Proponents of this 
approach hold that when inflation is moderate but still above the long-run objective, 
the Central Banker should not take deliberate anti-inflation action, but rather should 
wait for external circumstances - such as favorable supply shocks and unforeseen 
recessions - to deliver the desired reduction in inflation. While waiting for such cir¬ 
cumstances to arise, the Central Banker should aggressively resist incipient increases 
in inflation. The policy maker who is endowed with these preferences gives higher 
weight to stabilizing output when inflation is low, but higher weight to inflation when 
inflation is above target. Such an approach is motivated by the views expressed by 
leading Central Bankers. For instance in testimony before the Senate committee that 
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Figure 5.6: A tough central bank ( a,d,e ) and one with the same preferences 
as society (g, c, f) 


rewritten so that the central bank delivers both zero inflation and maxi¬ 
mum stability. In principle this is easily done (Walsh, 1995) by adding a 


was meeting to consider his nomination to the Federal Reserve Board, former Vice 
Chairman Blinder summarized his views on this issue as follows: 

‘If monetary policy is used to cut our losses on the inflation front when luck runs 
against us, and pocket the gains when good fortune runs our way, we can continue 
to chip way at the already-low inflation rate.’ [Blinder, 1994, p.4]. 

Nobay and Peel (1998) consider the case where Central Bankers have an asymmet¬ 
ric objective function. This is motivated in part by the European Central Bank’s 
announced objectives according to which inflation should not be above 2 per cent, 
but is allowed to be below that level. 

Specification of such asymmetric preferences leads to terms in the variance of infla¬ 
tion or output impacting on expected inflation. In particular there is a deflationary 
bias in the ECB example. Essentially the Central Bank has to shoot for a lower 
expected rate of inflation to allow for the conditional variance of shocks. 

Other authors (see e.g. Beetsma and Jensen, 1997; Briault, Haldane and King, 
1996) have explored the implications of assuming that the Central Banker’s prefer¬ 
ences, though quadratic, are uncertain. In particular the weight on say inflation has 
a stochastic component. Such specifications have implications for expected inflation 
and the efficacy of inflation targets as opposed to inflation taxes (Walsh, 1995) from 
a welfare perspective. 
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Figure 5.7: Choosing the optimal central bank preference parameter (o) 


linear inflation target into the bank’s preferences (it can be thought of as 
forcing an extra linear penalty on inflation); this can then be chosen to 
force the inflation bias to zero while the a parameter is left at the social 
preference value so guaranteeing that stabilization is maximized. There 
are other ways that this can be expressed such as telling the bank to sta¬ 
bilize the economy optimally subject to average inflation (i.e. inflation 
in the absence of shocks) being equal to target. 

This is the institutionalist view. However it is challenged by those 
who argue that it only artificially ‘solves’ the time-inconsistency prob¬ 
lem (McCallum, 1995). It relies on institutional restraint to prevent 
governments doing what suits them. Yet suppose that a government is 
restrained on one occasion by the cost of overriding the constitution; 
will it not find a way of doing it somehow at less cost in the future? 
Popular opinion will be forever straining at the leash demanding that 
its politicians deliver lower unemployment by a ‘little bit of inflation’. 
When politicians are rewarded by popularity will they not in the end do 
whatever is necessary to gain it? 

The institutionalists reply that popular opinion is also in favour of 
the reform when ‘in institutional mode’. Yet this is inconsistent; popular 
opinion cannot be one thing in one mode and another thing when voting 
in elections. Either it wants restraint or it does not; it cannot do both 
at the same time. 

One is driven to reject institutionalism as superficial, and failing to 
address the problem at its roots. The Institution is like the deus ex 
machina with which the Greek dramatists often ended their plays, just 
as a convenient device to end them but without inherent justification. 
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Greek audiences accepted it because they understood wliat it stood for: 
viz a way of giving the audience the moral of the story (delivered by the 
god). We have to probe deeper to see if a similar basis can be found for 
the institutional story. 

To find one, we go to political economy and ask whether it could 
provide a solution. This second strand of thought is a fairly recent reac¬ 
tion to the institutionalist approach (Minford, 1995; Romer and Romer, 
1997). Institutionalism has produced some empirical evidence in its 
favour (for example Cukierman, 1992), showing a correlation between 
the degree of Bank independence and the rate of inflation. This can 
be criticized on two levels. First, the definitions of independence are 
themselves influenced (implicitly if not explicitly) by success in keep¬ 
ing inflation down; the degree of ‘toughness’ (independence) of a central 
bank is necessarily evidenced by inflation itself. The example can be 
given here of the Bundesbank which always scores the most highly on 
measures of independence; however in strict terms it has no real indepen¬ 
dence as it could be nationalised by a simple majority of the lower house 
of parliament (the Bundestag). Its power comes, as board members will 
frequently say, from its support in public opinion. Second, correlation 
cannot prove causation here; it may be that as public opinion becomes 
disenchanted with inflationary policies, it decides to ‘institutionalize’ this 
by setting up an independent central bank. 

The revisionist view is precisely this: that the electorate begins to 
understand the time-inconsistency problem as a potential source of in¬ 
flation, much as an alcoholic understands that his short-term giving-in 
to the craving for the bottle is the cause of his disease. Wishing to en¬ 
sure that inflation is kept low on average in spite of the desirability of 
low unemployment and of stabilizing the economy, it redesigns its own 
voting reflexes to reward politicians appropriately. It will henceforth 
penalize ‘reflation’ whose aim is to stimulate the economy beyond its 
natural rate; but it will reward stabilizing monetary policy, defined as 
that which, while reacting to shocks in a stabilising manner, does not on 
average drive output beyond the natural rate. The electorate does this 
because it recognises that its ‘preferences’ (as expressed in these rewards 
and penalties) are the root cause of political behaviour. 

This view requires that the electorate is coordinated in some way so 
that its punishment strategy can be formulated and exercised. But this 
is not difficult to understand. We speak of the anti-inflationary ‘climate 
of opinion’ in Germany, or in the USA and Britain since the 1970s; what 
we mean is the popular reflexes that, no doubt heavily influenced by 
‘informed opinion’, underlie voting behaviour. 

Furthermore this view leads us to the practical case for an indepen- 
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dent central bank: that it is an efficient low-cost way of organising the 
execution of monetary policy in line with such a public opinion of the 
problem —just as having independent courts is for the law, or privately- 
run (privatised) companies is for business. We can say that it does ‘in¬ 
stitutionalize’ the solution effectively. In other words it tells us how we 
can make sense of the institutionalist case which in itself solves nothing. 
The solution of the time-inconsistency problem is at the basic level of 
public understanding, just as that of alcoholism is at the level of personal 
understanding. Without understanding no solution is possible. With it 
any set-up for monetary policy would in principle work (just as an alco¬ 
holic has many means in principle for stopping drinking to destruction); 
but a central bank is a convenient method in many ways (just as join¬ 
ing a group like Alcoholics Anonymous is for alcoholism). Empirically 
it makes sense of the wide variety of institutions under which countries 
have had both low and high inflation (Roiner and Romer, 1997); an inde¬ 
pendent central bank is neither necessary nor sufficient for low inflation. 
It merely happens to be convenient for it in a democracy. 


CONCLUSIONS 

We began this chapter by asking whether there was any case in principle 
for the government to intervene in the business cycle via variations in 
interest rates or money supply growth. After all private agents have 
access to the same information as the government and their decisions 
would be expected, in the absence of micro distortions, to reflect social 
as well as private costs and benefits. However we identified the crucial 
distortions in the labour market (notably the presence of unemployment 
benefit) that made unemployment non-reflective of social costs; since 
the business cycle will under these conditions give more unemployment 
on the downside than less unemployment on the upside, greater cycle 
variability will produce a higher average distortion. Hence successful 
stabilization policy would reduce damaging unemployment on average. 

We then set out the problem of time inconsistency as a cause of ex¬ 
cessive inflation; the point being that governments, having started out 
with a sensible aim for low inflation, will have an incentive to stimulate 
the economy further than this in order to bring unemployment down 
closer to full employment. People, knowing this incentive, then react by 
raising their inflation expectations to the point where governments just 
have no further incentive to inflate; this of course means that inflation 
is higher than the target while at the same time the economy does no 
better on unemployment — ‘stagflation’. Various models of inflation 
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‘credibility’ have been built on the idea that private agents may react 
to past behaviour in forming views of whether a government will in the 
future succumb to the temptation to ‘reflate’; these models are attempts 
to see whether the natural operation of private markets and reputation 
can reduce this governmental tendency. However these models are unsat¬ 
isfactory in a variety of ways and economists have turned to other ways 
of resolving the problem. Many economists and policy-makers called 
for the power over monetary policy to be either withdrawn totally from 
government (for example by a constitutional clause or by fixing the ex¬ 
change rate to an external monetary anchor) or delegated to an indepen¬ 
dent central bank with a strictly limited mandate for stabilization and 
an overriding obligation to control inflation. This of course means that 
the power of monetary policy to stabilize is largely jettisoned. Some 
institutionalists have tried to deal with this problem by arguing that 
politics will generate institutions, most naturally an independent central 
bank, to restrain politicians’ reflation instincts (as encouraged by popu¬ 
lar opinion) while still allowing the flexibility to deal with shocks. The 
Bank’s charter can be tuned to deliver the first-best combination of low 
inflation and output stability. However this is only a solution of the time- 
inconsistency problem in a superficial sense because it can only work if 
public opinion tolerates it and does not clamour for an override when re¬ 
flation is attractive. We argued that the true solution of the problem lies 
in public opinion itself: understanding how it must reward and penalise 
political behaviour over monetary policy. Given such understanding, a 
convenient but in no way essential way to organise monetary policy is 
then via an independent central bank accountable to public opinion for 
generating low inflation and output stability. 
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Practical issues of effective 
stabilization 


In the last chapter we asked whether there was a case for monetary 
policy to stabilize the economy: we concluded that there was. In this 
chapter we ask what in practice must be done to carry out this stabi¬ 
lization function successfully. We begin with the issue that has haunted 
policy makers since rational expectations became widely accepted: Lu¬ 
cas’ critique of policy choice (Lucas, 1976) which makes the point that 
the models being used for the formulation of policy rules may well not 
remain constant as these rules are reset because the behaviour modelled 
depends on expectations which themselves depend on the rules. Dealing 
with this critique turns out to be an elusive business. Many economists 
have however persuaded themselves that models with overlapping wage 
(and sometimes price) contracts have sufficient constancy under chang¬ 
ing rules to be reasonably reliable as a guide to policy formulation. A 
large literature has grown up using such models to test different sorts 
of rules. We look at these tests and other ways of evaluating policy in 
practice. 


THE LUCAS CRITIQUE OF POLICY EVALUATION 

It was Lucas (1976) who first pointed out that if expectations are formed 
rationally, then unless the estimated equations used by model builders 
to evaluate the consequences of alternative government policies are gen¬ 
uinely structural or behavioural, the implications of such simulations or 
evaluations may be seriously flawed. The essential insight of Lucas is 
that, when expectations are formed rationally, agents react to the be¬ 
haviour of government. Consequently, unless the equations estimated 
by the model builder are structural, the coefficients in equations which 
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are estimated over one policy regime will implicitly depend on the pa¬ 
rameters of the government policy rule in operation. Consequently, the 
evaluation of alternative policies can be quite misleading. 

Let us take as an example the following simple model, with overlap¬ 
ping contracts (so that stabilization policy is effective), a quantity theory 
demand for money function and a money supply rule that responds to 
past output. All the variables are measured in deviations from their 
normal values. 


Vt = (3{p t - 0.5[E t -iPt + E t _2Pt\) 

(1) 

m t =Pt+yt 

(2) 

nit = pyt- 1 + Et 

(3) 

The solution is 


/ 3 0.5/Z/3 2 

Vt l+/ t+ (1+/?)(! + 0.5/3) ° t ” 1 

(4) 


from which it is immediately apparent that the optimal monetary re¬ 
sponse is nil {(i = 0). 

What we have just done is to set out the model in its ‘structural’ form 
(that is, in terms of its underlying relationships). But suppose we had 
written the model in a ‘reduced’ or solved form, with each variable in 
terms only of its ultimate determinants after all the model’s relationships 
have worked themselves out in the current period. Equation (4) is such 
a form, but since the error term, e t , is not directly observable (it is an 
implication of equation (3), we could write (4) in terms of observable 
ultimate determinants m t and y t ~ i (using e t = m t — y t ~ i from (3)) as: 

y t = a im t + CF- 2 m t -i + <J 3 y t -i + a^t -2 (5) 

where oq = ct 2 = jftf+oWj ’ °" 3 = 04 = ^ a 2 - 

Having recovered this relationship from the data, we could compute 
the optimal monetary policy for time f, given what has already occurred 
in t — 1 (the planning period). It is obvious that we must set m t so as 
to make y t = 0 (we have one instrument, m t , and one target, y t , so the 
problem here is simple) or: 

m t = — —(a 2 m t - 1 + cr 3 y t -i + <7iy t -2) (6) 

<71 

According to our relationship (5), this feedback rule for money supply 
ought to deliver zero fluctuation in output. However, in fact it may de¬ 
liver worse fluctuations even than the feedback rule (3) the central bank 
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was previously following! Actual money supply will be given by (6) plus 
et, which is the unpredictable error in executing monetary intentions. 
The resulting rule can be written as: 


m t = ny t -1 + 


£t 


1 + 


0.5fi/3 r 
1+0.5/3 ^ 


(7) 


(using our knowledge of the <r,; in terms of the model’s structural param¬ 
eters). When this is substituted into the model, we find the new solution 
for y t as 

_ P , 0.5/+/3 2 ^ 

Vt l+p £t (l+/3)(l + 0.5/x/3) 2 "‘ _1 

Here output is far from perfectly stabilized with respect to past events 
— though it will in this instance be slightly more stable. 

Where we went wrong was that we used the reduced form to calculate 
the optimal rule. But that reduced form itself depends on the monetary 
rule because the rule influences (rational) expectations and so economic 
behaviour (this problem would not arise in adaptive expectations mod¬ 
els, where the effect of money on output does not depend on the rule 
itself). So the reduced form will not remain constant (as assumed) as 
we change the rule towards the ‘optimum’. Hence we must compute a 
false optimum. 

The policy maker using these methods of optimal control is doomed 
to go on recomputing his ‘optimal’ rule as his ‘model’ constantly changes: 
it will not necessarily converge to a constant rule and model, however, 
nor is there any guarantee that it will get any better over time. It could 
get worse, or alternatively better and worse. In general, we cannot say. 

The problem we have just described is not too difficult to solve. Once 
the policy maker recognizes that the model underlying (5) is a rational 
expectations model consisting of (1) to (3), he will use the appropriate 
method and set //, = 0! Estimating the structural parameters of the 
model is a technical problem not in principle more difficult than esti¬ 
mating the reduced form parameters (we discuss the problem briefly in 
chapter 16). 


DEEP STRUCTURE 

We have assumed that (1) and (2) are truly ‘structural’, that is, that 
they are invariant to the policy maker’s rule. However, this may well 
not be so. People’s behaviour is itself the result of optimizing subject to 
constraints of budgets and technology; policy rules will typically change 
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their budget constraints because expected future prices and incomes will 
depend in general on the economy’s behaviour which in turn depends on 
the rules. Equations (1) and (2) express people’s resulting behaviour in 
demand and supply functions; these may therefore alter as policy rules 
alter. Rational expectations implies that the vast majority of the eco¬ 
nomic relationships econometricians typically estimate are not strictly 
structural. 

A simple example of this arose with the model of equations (22) and 
(23) in chapter 4, where we assumed signal extraction from local prices. 
We saw that output depended on the stabilization parameter, fi: 


yt-y 


q-0) 

_ 8 _ 

1+ s 6(l+a) 


£t 


(9) 


It might seem from this that for maximum stability // should be 
pushed as high as technically feasible (implying dramatic monetary con¬ 
traction in booms, expansion in recessions). However, this conclusion 
treats 0 as a structural parameter. It is not, since we know that <f> = 

7T 2 <7 2 

° f ;; , the slope of the Phillips curve, depends on the extent to which 


the variance of local prices reflects local shocks and overall monetary 
instability. To compute the appropriate value of we must allow for this 
as discussed in chapter 4. 

Lucas’ critique therefore raises deep difficulties for econometricians 
attempting to recover from the data relationships which are usable for 
policy making. 

One reaction to this problem has been to assert that only the param¬ 
eters of preferences and technology (‘deep structure’) will be regime- 
invariant and that macroeconomists should therefore estimate these. 


Some early examples are Hansen and Sargent (1980) and Sargent (1978, 
1981). A research methodology along these lines is now in full swing. 
This work models agents at the microeconomic level as intertemporal 
optimizers subject to the constraints of budget and technology, and at¬ 
tempts to retrieve the parameters of the (aggregated) utility and pro¬ 
duction functions. In chapters 11-13 we set out some key features of 
these ‘representative agent’ models. The problem is to find a way of em¬ 
bedding sufficient detail into the constraints facing agents to allow these 
models to confront the data successfully. 

The models use estimates of utility and production function parame¬ 
ters derived from microeconomic studies (usually of large cross-sectional 
data sets). Processes for shocks (error terms) to technology and pref¬ 
erences are then estimated from the time series when restricted by the 
results of these studies. The success of a model in predicting the facts 
of the business cycle is evaluated by comparing the second and higher 
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moments of the predicted and actual series: techniques of statistical in¬ 
ference from this comparison are not yet widely in use, though available, 
whether because of the high costs of using them or because the model 
practitioners do not feel that such stringent testing is appropriate, given 
the models’ relative infancy. Whether such models yet have sufficient 
dynamic structure to be useful for evaluating stabilization policy is still 
a matter of debate. However, they do have the clear strength that, being 
in principle invulnerable to the Lucas critique, they should give a useful 
guide to the longer-run effects of policies once people have understood 
the new environment. 

Models of this general equilibrium type are widely used in evaluating 
such policy issues as the effects of world tariff reduction rounds or the 
introduction of funded private pensions. But for this purpose the interest 
is in the long-run steady state results and often we have reasonable 
confidence in at least the rough magnitude of the long-run parameters 
to be used. The problem with using these general equilibrium models 
for evaluating stabilization policy is that the parameters of importance 
are the dynamic ones that govern lags of response — that is, those 
of expectations and adjustment principally; these have generally been 
retrieved from time-series estimation as in cross-section there is usually 
not enough time variation to estimate response to change. However, in 
recent years ‘panel’ data-sets (where there is variation across agents and 
also over time) have offered the promise of resolving this problem. 

A second school of thought, of which a major proponent is Sims 
(1980), asserts that there is no practical possibility of policy evaluation 
and the best we can achieve is the estimation of time-series models whose 
parameters will shift in an unpredictable way with regime change. He has 
proposed the estimation of simultaneous time-series models (i.e. where 
each variable depends on its past and also on the past of the other 
variables in the model). These ‘Vector Autoregression 1 (VAR) models 
have proved extremely popular as ways of summarising an economy’s 
behaviour. The difficulty with them is of interpretation: plainly they 
are not structural and so inferring what the structural relations may 
be that underlie these solved-out time-series relationships requires the 
assumption of ‘identifying restrictions’, in other words the modeller must 
assume some structural model. The method however means that the 
links between the structural model and the VAR parameters are complex; 
in practice the structural parameters cannot be retrieved solely from the 
VAR estimation. Again, this strand of thought has prompted a huge 
research programme to refine the method. 

A final reaction has been to model expectations explicitly, but to con¬ 
tinue to treat as structural the parameters of such macroeconomic model 
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equations as the consumption and investment functions; this approach 
has been adopted for example in the Liverpool model (chapter 16) and 
other examples are Blanchard and Wyplosz (1981), Taylor (1979a) and 
Holly and Zarrop (1983). It is recognized by these authors that the 
parameters of these equations will change as regimes change, but it is 
argued that the major impact of regime change will be felt in the expecta¬ 
tions variables, while that on the parameters themselves, except for quite 
violent regime change, may be of second order importance. Nevertheless, 
these models have increasingly attempted to introduce ‘micro-founded’ 
equations, thereby getting closer to deep structure. This approach has 
tended to dominate hitherto, practical policy evaluation mainly being 
done using models of this type, with on the face of it reasonable success 
in the past decade in controlling both inflation and output fluctuations; 
we discuss the results in the next section. 

We end this section, therefore, in a cautious vein: policy evaluation 
is certainly difficult, but various researches are in hand which may offer 
scope for better evaluation in the future. 


EVALUATING POLICY IN NEW KEYNESIAN MOD¬ 
ELS WITH OVERLAPPING WAGE/PRICE CONTR¬ 
ACTS 

In practice many modellers in the 1990s work with overlapping contract 
models of the New Keynesian type which we discussed in chapter 4. As 
we have seen these models exhibit a high degree of nominal rigidity which 
enables monetary policy to be effective in stabilizing the economy. 

A simple framework for looking at this work is the New Keynesian 
model of chapter 4, discussed earlier in this chapter (equation 55), to¬ 
gether with the assumption that the government or central bank controls 
aggregate demand through monetary policy. In that set-up, let us ignore 
the time-inconsistency issue, as we argued was appropriate at the end of 
chapter 5; we will assume that there is some mechanism, such as that of 
popular disapproval, restraining the government from trying to exploit 
the Phillips Curve by systematically stimulating the economy beyond its 
natural rate. So the government’s actions are restricted to varying pol¬ 
icy in response to shocks to the economy, as summarised in the Phillips 
curve shock (remembering this can reflect the whole menu of supply and 
demand shocks through intertemporal substitution or exchange rate ef¬ 
fects on the supply of labour and output). Since this set-up is known, 
people will also know the rule, and hence will form their inflation ex¬ 
pectations knowing it will hold; therefore inflation expectations over the 
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long term are equal to the inflation target, while in the short term they 
depend on the speed with which inflation converges on the target, as 
determined by the parameters of the model including policy. Our New 
Keynesian supply curve is: 

Vt = cr(p t - 0.5 [E t _ip t + E t _ 2 p t }) + 73/t-i + u t (10) 

where y t is the log deviation from the natural rate (i.e. y* = 0). 

In practice of course models used for policy evaluation are more com¬ 
plicated than this. But this simple model is sufficient to illustrate the 
nature of the choices available. What emerges is a trade-off between 
the variability of inflation and the variability of output; not of course 
between the average levels of output and inflation between which there 
is no (long-run) trade-off — the average level of output is its natural 
rate and the average level of inflation is its target value, varying which 
will of course make no difference to output. But inflation changes can 
accommodate shocks to some degree and so dampen down their effect 
on output; this reduces output variation at the cost of raising inflation 
variation. 

We saw in chapter 5 that if there is a social welfare function which in¬ 
cludes the squared variations in inflation and output around their target 
and natural rate respectively then expected social welfare will depend 
on the variances of inflation and output. The optimal feedback response 
of inflation to the shock (which we assume is held to for as long as the 
model prevails) will then as we saw depend on the degree of dislike of 
output variation and on the slope of the Phillips Curve. The flatter the 
slope (the higher c) the less the response required; the greater the dislike 
of output variation (a) the higher the response. 

One can interpret this social welfare function — which is certainly 
widely used by central banks in practice — as reflecting on the one hand 
the individual agent’s dislike of inflation uncertainty (in creating con¬ 
sumption uncertainty which needs then to be insured at some cost) and 
on the other hand the social cost of unemployment variation, as dis¬ 
cussed above in chapter 5. Various efforts have been made to produce 
such micro-based justifications (e.g. Rotemberg and Woodford (1998), 
Woodford (1998), and Minford, Nowell and Webb (1999)). If one uses 
this interpretation it follows that reducing the extent of the unemploy¬ 
ment benefit subsidy to unemployment would reduce the extent of the 
distortion from unemployment variation and so reduce the size of a and 
the need for activist monetary accommodation. However, few societies 
appear to have reached this situation; and the consensus among policy¬ 
makers is that monetary policy ought to stabilize output fairly actively. 
Witness for example the behaviour in 1997-8 of the Federal Reserve 
Board in response to the Asian Crisis or that of the Bank of England’s 
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Monetary Policy Committee to the threat of recession in autumn 1998; 
in both cases quite sharp cuts in interest rates were made in order to 
avert recession. 

In recent policy therefore, as dominated by this New Keynesian 
framework, active feedback monetary response has been prevalent. To 
implement this response central banks have favoured using interest rate 
rather than money supply rules. Demand for broad money functions 
have proved unstable in the world of deregulated banking that has gen¬ 
erally prevailed in the 1980s and 1990s; while narrow money demand 
functions, though immune to deregulation, have proved vulnerable to 
technological change (credit cards and cash machines), to movement 
in the black economy, and to increasing use of home notes in foreign 
economies (e.g. ‘dollarization’ in Israel and Russia). As the interpreta¬ 
tion of money supply movements became more uncertain central banks 
have relied increasingly on direct evaluation of economic conditions and 
a response via interest rates (this implies some equivalent money sup¬ 
ply response — but evaluating it is of course uncertain when demand 
functions are unstable). 

In chapter 5 (and the appendix to chapter 11, on dynamic program¬ 
ming) we derive the optimal response of inflation to the Phillips curve 
shock on the assumption that this can be exactly delivered by inter¬ 
est rate movements; the optimal response to other shocks is nil on this 
assumption. In practice however the shocks cannot be observed and in¬ 
terest rate policy cannot deliver an exact inflation outcome; and so a 
choice must be made between rules (of reasonable simplicity) that re¬ 
spond to observable variables, usually inflation and output themselves. 
This choice can be made by embedding these rules in a model of the econ¬ 
omy and checking the resulting variances of output and inflation (either 
analytically or by stochastic simulation).To see what sort of interest rate 
response would be optimal on this basis, assume that aggregate demand 
depends on real interest rates (an IS curve expressed in nominal terms; 
using interest rates bypasses the LM curve): 

Pt + Ut = ~Pr t (11) 

where p t is in log deviations from the (moving) price level target and r t 
is in deviations from the natural real rate of interest. 

We use prices in place of inflation and a moving price level target 
in place of an inflation target as this simplifies our exposition. (To 
re-express policy in terms of inflation targeting would introduce more 
complicated dynamics.) 

We now write the interest rate rule as: 


n = UpPt -1 + PyVt -1 


(12) 
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(To turn this rule into one for nominal interest rates one simply adds in 
the expected inflation rate.) We can set the solution up using the Muth 

OO 

method, with Pt = Qi u t-i , by substituting from (10) and (12) into 

i =o 

( 11 ) for y t and r t , to obtain 
(1 + (3p y L)([l + aq 0 \u t + 0 . 5 aqiu t -i) + 

OO 

(1-7^)(1 +/3[i p L)J2qiUt-i = 0 (13) 
Collecting terms then gives us: 


i=0 


K) 


go = 


l 


( Ut- 


<7i 


1 + a 

PiVp - Hy - l) 


(1 + a)(l + 0.5a) 

(ut- 2) q-2 = -qi( 0 . 5 Pn y cr + ( 3 p p - 7) + (- yflfJ, p )qo 

(u t -i, i ^ 3) qi = -((3p p - 7)®-i + (7/5 m p )®-2 
Now turn to the variance of output. Because 
(1 + aq 0 )u t + 0 . 5 <?iUt_i 


(14) 

(15) 

(16) 
(17) 


yt = 


1 — 7 L 


= (1 + cr qo)u t + V(1 + ag 0 + —^) 7 *u t -» ( 18 ) 

Z-/ /-V 


i =1 


the variance of output is 


vary t = {(1 + ag 0 ) 2 + (1 + ag 0 + 
By contrast the variance of prices is: 


0.5gi 2 7 ' 


7 


) —-- } 

j 1 — 7 2 ^ 


uarp t = ^ qfa 2 u 
i—0 


(19) 


( 20 ) 


Policy cannot affect g; but it does affect all other g^. The optimal 
set of these for minimising varp t are g^ = 0 (i ^ 1). The optimal set for 
minimising vary t is (1 + ag 0 + ^-) = 0 , that is gi = 0 , implying 
that 




v 'P 


- 7 + 


7(1 + 0.5a) 
05/3 


( 21 ) 
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whereas for qi = 0 we must have 

M y = M P - 7 (22) 

Hence there is a trade-off between vary t and varp t , involving the 
choice of q\. This is a general feature of these models (Taylor, 1999; 
Clarida, Gali and Gertler, 1999) in respect of the Phillips Curve shock, 
u t - Plainly, in such models r t should aim to offset any ‘pure’ demand 
shocks (i.e. any shock to equation 11) entirely, since they must add to the 
variances of either prices or output or both; however, in practice matters 
are not so simple since as we have seen demand shocks will also enter u t 
(via supply or cost effects induced by interest rates or exchange rates). So 
the trade-off between the two variances is pervasive across most relevant 
shocks and should therefore show up in stochastic simulations of models 
for a complete set of shocks. One can see that a society that cared a lot 
about output fluctuation would set a y, y close to (21) and then find a n p 
that minimised qf (i ^ 2) subject to this. Clearly the closeness of fi y to 
(21) would depend on how much society cared about output variability. 

Rather than find the optimal /is for this particular, illustrative and 
highly simplified model, we report on findings from full models about 
the results of stochastic simulations with interest rate rules of the form 
of (12). Rules of this type have been called Taylor rules after John 
Taylor who first propagated them having completed a spell on the US 
Council of Economic Advisers, where he felt the need to come up with a 
‘monetarist’ rule that did not rely on targeting the money supply given 
its instability — the original suggestion for such rules was by Henderson 
and McKibbin (1993). Econometric work has found that rules of this 
type fit central bank behaviour quite well; this does not necessarily mean 
that they actually adopted these rules but rather that whatever rule 
they did adopt produces interest rate behaviour of this sort (Minford et 
al. (2001), show that a wide variety of monetary rules can produce an 
appearance of a Taylor rule). Given that economic behaviour varies with 
the policy regime, we should be rather cautious in translating results for 
Taylor Rules into equivalent results say for money supply rules. But we 
will assume in what follows that having a coefficient on output in a Taylor 
rule is qualitatively at least similar to having a feedback coefficient on 
output in a money supply rule. 

In a recent article Taylor (1999) compared various parameteriza- 
tions of such rules within a variety of econometric models of different 
economies (containing many complexities omitted here but obeying the 
basic logic of our simple model) and concluded that a quite simple rule, 
it = 1.57r t -t-0.5y t -l-io, did best out of those investigated. His results show 
that if the lagged interest rate is included with a coefficient of unity (so 
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Standard deviations* of 

Inflation 

Output 

Interest Rate 

Output Coefficient = 0.5 

2.13 

1.94 

2.82 

Output Coefficient = 1.0 

2.16 

1.63 

3.03 


^average across nine models; % p.a. for inflation and interest rate; % for output. 


Table 6.1: Average behaviour of rules of nine models 


that it is the change of the interest rate that reacts to output and infla¬ 
tion) then in models without rational expectations the results are often 
unstable; this is because the interest rate keeps on rising in response to 
a stubborn inflation shock and since expectations of the future effect on 
inflation fall only slowly, interest rates overreact. It is possible to reduce 
the variance of output by increasing the coefficient on output to 1.0 say. 
But the cost is a slightly higher variance of inflation and a much higher 
variance of interest rates. The average behaviour with these two rules 
across all the nine models examined is shown in Table 6.1 

There is therefore some evidence of a trade-off in estimated models 
between output and inflation variability; also between output and inter¬ 
est rate (instrument) variability, the latter naturally rising as rules are 
more ‘activist’ with regard to stabilising the business cycle (i.e. with a 
higher output coefficient). Different central banks adopt varying posi¬ 
tions along these trade-offs, with the more ‘conservative’ ones adopting 
lower output coefficients; during 2000 Mr. Wim Duisenberg, the chair¬ 
man of the new European Central Bank (ECB), for example, stated that 
the ECB would not be ‘activist’. However, it is too early to say whether 
its policies will adhere closely to this position in practice. 

During the 1990s central bank practice, following this sort of behav¬ 
iour — whatever its degree of activism —, appears also to have been 
successful, both in curbing inflation and in preventing large swings in eco¬ 
nomic activity, at least in the Anglo-Saxon world and in Europe. Japan 
has had an unhappy decade of persistent recession after the ‘bubble¬ 
bursting’ monetary squeeze at its start and the Asian Crisis of 1997 
caused a massive recession in Asia; but these developments were limited 
in their impact on the rest of the OECD and much of the credit for that 
seems to be due to effective monetary policy. 


CALVO CONTRACTS 

A popular device for modelling long-term nominal contracts is due to 
Calvo (1983). Price-setters (or wage-setters, analysed analogously) oper¬ 
ate under imperfect competition where if prices were flexible they would 
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be continuously set as a mark-up on marginal cost. They are assumed 
to face a menu cost of changing their price: this takes the form of a 
lump sum cost which acts as a threshold. If some unexpected shock to 
costs exceeds this threshold, they will change their price and set it to 
the newly expected marginal cost. It is assumed that there is a constant 
probability, 1 — £, of such a shock for each (identical) price-setter. The 
expected losses at t = 0 of the hth price-setter can be written: 

OO 

^Eo^-il + m)^} 2 (23) 

t —0 

where m is the mark-up and c is the marginal cost. 

The first-order condition with respect to the decision to set his price 
at pg then implies: 

OO 

P h o = (1 + m){ 1 - ft) £(/%)%<£ (24) 

t=o 

In other words the reset price is equal to a weighted average of all 
future expected marginal costs plus the mark-up. This expression is 
justified as follows. Consider the losses associated with the decision to 
set Pq. For t = 0, p§ can be freely set and so the loss is [pg — (1 + rn)cg] 2 . 
At t = 1 there is a £ chance of being unable to change his prices from 
Pq and a l-( chance of being able to reset it, in which case any loss 
is not to do with today’s decision; hence the expected loss at t = 1, due 
to today’s decision, is /3£[pg — (1 + m)EoCi] 2 . Similarly at t = 2, there 
is a £ 2 chance of being unable to change it from p(f in either t = 1 or 
t = 2; there is a (1 — £) 2 of being able to change it in both periods, a 
£(1 — £) chance of changing it in /; = 1 but not in t = 2, and similarly 
of not changing it in t = 1 but doing so in t = 2. In all these last 
theree cases nothing decided for t = 0 affects the losses which are due 
to decisions taken in later periods. So the expected loss at t = 2 due 
to the decision at t = 0 is /3 2 £ 2 [pg — (1 + m^gdl] 2 . Analogously at 
t = i it is P % C[Po ~ (1 +m)EoCi] 2 . Equation (23) will only contain these 
terms in p([ in other words. So differentiating it with respect to pg yields 
the first-order condition above. In general at time t the /ith’s agent’s 
decision is therefore: 

OO 

p* = (1 +m)(l - ft) (25) 

i =0 

Across the population 1 — £ will on average reset their prices in ex¬ 
actly the same way, and £ will retain last period’s price, hence py = p t , 
E t c^ +i = E t c t +i and 


p t - p t _i = (1 - 0(Pt - Pt- 1) 


(26) 
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Now let (1 — m)E t Ct+i = 6E t (y t +i — y% +i ), that is marginal costs are 
a rising function of output (and approximately linear in the region of y%, 
the natural rate of output). Then we can rewrite (26) as: 


7 U =Pt ~Pt-l = 


(i-oa -ernyt 

1 - 


vi) 


- (i -Opt-i 


(27) 


where D is the forward operator, leading the variable but not the date 
of expectation. Hence, multiplying through by the expression in the 
forward operator and collecting terms, we obtain 


K , g(i-Q(i-/%) ,. _ 

nt l - 0£(i - £0 Et * t+1 + 1 - 001 - PC yt Vt 

( 1 - 0 ( 1-00 
i - 001 - PC) 


Pt-l 


(28) 


This is the Calvo forward-looking Phillips Curve in which effectively 
the whole path of future output (marginal costs) affects current price 
rises. 

In what we did above we implicitly assumed that price-setters were 
resetting their prices when expected general inflation was zero; the idea is 
that the shocks they face are relative, micro, shocks. If they expect rises 
in the general price level, then it is usual to argue that, being general, 
all would be expected to raise their own prices in line. It would be like a 
general relabelling or indexing of prices that would not incur the menu 
cost, which arises from getting out of line with others. Thus all authors 
(e.g. Erceg, Henderson and Levin, 2000; Christiano, Eichenbaum and 
Evans, 2002) add on an ‘updating term’ to allow for general expected 
(or ‘core’) inflation. Thus n t is to be read as 7r t — W t where 7f t is this 
expected general inflation rate. Strictly therefore we should write: 


7T t = 7T t - 




6 ( 1 - 0(1 -00 
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(Pt -1 - Pt-l) 


■vi) 

(29) 


where the last term is usually neglected of small order. can 

be greater or less than one (so that the forward sums converge provided 
that (y t — Vt) converges at a rate 7 such that 'J -< 1 ). 

7 ft can be thought of as the rate of general indexation; for that rea¬ 
son some authors (e.g. Christiano et al., 2002) write it as 7r t __i to allow 
for a lag in indexation; this introduces lagged inflation as well as ex¬ 
pected future inflation into the Phillips Curve. It must be confessed 
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however that there is some confusion on this point; clearly, for example, 
if 7ft = E t n t = 7 r t and E t W t +i = E t TT t +i, the relationship dissolves. If we 
introduced an information lag, so that 7f t = E t -iTr t , then it becomes the 
standard surprise inflation Phillips Curve of Lucas, Sargent and Wallace 
discussed passim in earlier chapters. 

Equation (28) is typically used in New Keynesian models with an IS 
curve and an interest-rate-setting rule, so that the causation runs from 
the interest rate to output and so to inflation; notice that the rule must 
drive output back to y* faster than to avoid inflation divergence. 

Rotemberg and Woodford (1999) have argued in favour of ‘super-inertial’ 
interest rate rules of the form: 

n = ^’t-i + a(y t ~y* t ) + b{n t - tt*) (30) 

where A -< 1. The reason is that only very small reactions of interest rates 
are required to stabilise output and inflation because these reactions are 
not only very long-lasting but also diverge so that output and inflation 
are driven rapidly back to their targets by the prospect of such interest 
rate paths. However, as we saw above, with backward-looking influences 
on behaviour, such super-inertia creates possible instability. 

Calvo-style Phillips Curves have a form that is analytically tractable 
but their interpretation in the context of ongoing inflation is problem¬ 
atic. When underlying inflation is zero, one can under certain assump¬ 
tions write the representative agent’s welfare as a linear combination of 
the variance of inflation (which creates relative price distortions disturb¬ 
ing consumption) and the variance of the output gap (which disturbs 
employment and so leisure) — e.g. Gali and Monacelli (2002). This 
then justifies the standard approach to optimal monetary policy which 
involves trading off the variance of inflation and the output gap, in the 
manner used in earlier sections. 


DOES THE LUCAS CRITIQUE AFFECT NEW 
KEYNESIAN (TAYLOR) RULES? 

However, even if this monetary policy devised using New Keynesian 
methods has apparently been successful, we must ask whether this suc¬ 
cess is stable or could be undermined by some parameter shift of the 
sort implied by the Lucas Critique. For the models assume that the 
parameters (of Phillips and IS curves) remain constant even though the 
monetary regime is being altered towards an optimum. 

We saw in the first section of this chapter that the Phillips curve 
slope in particular depended on the variability of inflation. The more 
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activist the monetary policy, the higher this variability; and hence the 
the steeper the Phillips Curve (the lower a). This in turn implied that 
activism should be more moderate than implied with a fixed slope of 
the Phillips Curve. (In other words excess activism would actually, by 
steepening the Phillips Curve, make the combined variability of inflation 
and output worse.) The calculations reported by John Taylor do not take 
account of this extra loop of effect on the economy’s behaviour; and we 
may reasonably expect, given this argument, that the rules understate 
the response to inflation (i.e. interest rates should keep inflation closer 
to target instead and not be so influenced by output fluctuation). 

This line of thinking might also lead us to consider other aspects of 
monetary policy that could affect the slope of the Phillips Curve. In 
a recent paper Minford, Nowell and Webb (1999) pointed out that the 
persistence of shocks to the price level within the period of overlapping 
contracts has the effect of raising the variance of real wages within the 
contract period — because the more persistent the shock the more are 
contracted nominal wages disturbed from their intended average level 
over the contract period. This persistence could be expected to produce 
a high level of indexation which would greatly steepen the Phillips Curve; 
there is strong evidence of widespread indexation or equivalent practices 
since the 1970s with no real sign of any reduction during the low-inflation 
1990s. In terms of quarterly monetary policy it implies that central banks 
should pay a great deal of attention to keeping prices on average in the 
contract period close to the trajectory expected — effectively targeting 
the price level not the inflation rate. A rule that targets inflation (or 
indeed the growth rate of the money supply) implies that the price level 
will be non-stationary (i.e. has an infinite variance in the long run). 
Such total long-run uncertainty must undoubtedly carry economic costs. 

What these, and no doubt other related issues of parameter variation, 
indicate is that the optimization of monetary policy is a still more com¬ 
plicated matter than these New Keynesian methods imply. The classical 
economists set great store by the certainty of the price level in both the 
short and long run, in order to underpin the efficiency of the market price 
signalling process; they put no emphasis on activist monetary policy, be¬ 
lieving that a constancy of prices (and they thought by implication the 
money supply) would generate sufficiently stabilizing output behaviour 
by the private sector. Since Keynes monetary policy has increasingly 
been pressed into service to stabilize output. What we have yet to work 
out is how great a conflict there is between these two aspirations for 
monetary policy. 
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CONCLUSIONS 

In this chapter we accepted that monetary policy could effectively sta¬ 
bilize output fluctuations and also that there was at least in principle a 
case for so using it, at least to some degree; we thus turned to a consid¬ 
eration of what that degree might be in practice and how to calculate 
the appropriate rule of behaviour, whether for interest rates or for the 
money supply. We began by setting out the key practical problem of such 
calculations: Lucas’ critique of macro models whose parameters would 
shift in response to changing monetary rules. Clearly to calculate the 
appropriate response we must know the parameters of the model that 
will occur under every relevant monetary rule; this means in effect that 
we must know the parameters of the underlying structural model of the 
economy (i.e. the one with the parameters that will not shift in response 
to a changing environment). From that model — even if we do not use it 
directly — we can work out what will happen to the parameters of any 
other model that we do use directly. Just relying on estimated reduced 
form models would be insufficient because we would know they must 
change with changing monetary rules as the expectations implicit in 
them changed. Unfortunately the problem of parameter shift runs deep: 
IS/LM/Phillips Curve models with rational expectations solved explic¬ 
itly are still vulnerable because their supply and demand parameters are 
those of optimising behaviour and with different rules the optimal be¬ 
haviour changes. We discussed three main approaches to the problem. 
First, to build new models whose parameters were those of tastes and 
technology (presumably immune to policy change); this is empirically 
difficult because these parameters can only be observed indirectly. Sec¬ 
ond, to abandon any pretence of structural models by estimating time- 
series models whose parameters are easily recoverable; and to design 
policy rules that are robust to any possible variations in these parame¬ 
ters. The problem here is that we have little way of knowing how big 
these parameter variations might be in the absence of a structural model 
to predict them. Third, to assume that IS/LM/Phillips Curve models’ 
parameters (with overlapping New Keynesian contracts) are in practice 
reasonably immune to relevant variations in monetary rules. Much prac¬ 
tical work has been done with such models to find optimal interest rate 
(Taylor) rules; and this work has been very influential on central bank 
practice with apparently successful results in the 1990s. The question 
with which we concluded was whether these rules were in spite of these 
hopes vulnerable to Lucas’ critique (given the extent of indexation and 
equivalent practices that could steepen the Phillips Curve); at this stage 
we offer a guarded ‘yes’ to this question (the evidence also suggests that 
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indexation or equivalent practices are extremely widespread) and suggest 
that future work may give a stronger role to price level stability. 



7 


Fiscal Policy and the government 
budget constraint 


Up to now we have discussed fiscal policy as an element of demand 
management, separate from monetary policy and useable both as an au¬ 
tomatic stabiliser and for feedback policy. However, the main focus of 
our discussion of stabilization policy has been on monetary policy, which 
in practice has been almost exclusively the instrument governments have 
used for this purpose. In this chapter we consider the role of and con- 
traints on fiscal policy in more detail. We start with accounting, the 
implications of the government budget constraint; then go on to various 
models of how fiscal policy affects the economy; finally we turn to policy 
implications. 


GOVERNMENT SOLVENCY AND THE 
GOVERNMENT BUDGET CONSTRAINT 

The first question we must ask about government tax, any resources it 
raises by printing money with consequent inflation (its ‘seigniorage’), its 
spending and its borrowing is: what are the limits on a government’s 
actions? These limits are implied by its budget constraint, which relates 
these elements. It can conveniently be thought of as the borrowing re¬ 
quired to finance the budget deficit, the gap between spending, inclusive 
of debt interest, and tax plus seigniorage. Sometimes it is useful to refer 
to the ‘primary’ deficit which is spending excluding debt interest, less 
tax. To keep the exposition as simple as possible we will assume that 
all debt is one-period indexed bonds, the number of which we denote by 
b t : such a bond is a promise to pay (next period, t + 1) £1 plus the rate 
of inflation and a real rate of interest, ry ; hence its face value today 
is always £1. Later we will discuss complexities introduced by nominal 
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bonds. 

Let us define the budget constraint then as: 

bt+i ~ W = gt+ 1 ~ T t+i + r tbt. --75-= dt+ 1 + r t b t (1) 

-nt+i 

where d t is the primary deficit (in real terms); g t , r t are respectively real 
government spending excluding debt interest and real taxation; and H t 
is the monetary base (government’s own monetary liabilities) and so the 
real value of its increase is the seigniorage. This equation plainly tells 
us how debt will accumulate as the government runs primary deficits . 
We can project it into the future and track out the implications of such 
policies for debt. 

However, matters do not end there since plainly debt must eventually 
be paid off. ‘Eventually’ here could in practice mean at a future date 
which is constantly postponed — in other words, the government does 
not actually pay it off by any particular date because it continually 
persuades its creditors to take more debt (‘rolling it over’). Thus we 
can let the date of repayment be infinitely far in the future; nevertheless 
we assume that that day is some time in the future. We can put this 
another way: if a government is seen to be in a position where it will 
never pay its debts and so will have to default, then it will be violating 
the terms of its debt and so of its budget constraint: it will be insolvent. 
We will assume that the government does not behave in this way. (We 
will deal later with what happens if it does and a default is expected.) 

On this assumption the government’s outstanding debt defines the 
limits on its future deficits. It can only issue debt that is ‘backed’ by fu¬ 
ture primary surpluses exactly sufficient to pay it off with its interest. If 
its debt is so backed, then it is said to be solvent. Solvency is guaranteed 
if the government budget constraint is met at all times. 

This can be seen by repeated forward substitution for b t +i in (1); we 
will assume a constant real interest rate, r. So 


b t = 


St+i + bt +1 
1 + r 


( 2 ) 


where s t is the primary surplus. Continuing to substitute for b t +i we 
obtain: 


N 

*=x: 


i—1 


$t+i 

(1 + 7-)* 


bt+N 

(l+r) w 


( 3 ) 


If we now let N go to infinity and impose the condition that the 
debt will eventually be paid off (which we interpret as above: that the 
government may roll over its debts but is expected not to default, that 
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is to be in a position to pay them off some time), then the last term goes 
to zero and we obtain: 




(1 + r)* 


( 4 ) 


which is the solvency condition. 

In fact, we note from (3) that the condition for the remainder term 
to go to zero is weaker than that debt should be totally paid off at some 
(indefinite) date: it is merely that real debt should grow less rapidly 
than the real rate of interest. This shows a peculiarity of infinite time: 
that the government not only need never pay off its debts but may even 
raise its debt, provided the growth rate does not exceed the rate of inter¬ 
est. Buffer (1999) has called this the great ‘puzzle’ of fiscal arithmetic. 
Plainly if the economy has a finite life then when it ends debts must be 
paid off: everyone will wish to use up their wealth in consumption (or 
transfer it to another economy) before the end. The same applies if the 
economy has a finite life whose end is constantly postponed. 

In fact there is some difficulty in assuming the economy comes to 
an end; for behaviour being forward-looking, those of our theories that 
have people acting in the interests partly of their descendants would fail 
were the economy to stop — the meaning attached by people to life’s 
purposes would be drained away, as would the incentive to maintain 
any of society’s institutions or infrastructure. Our assumption is that 
people act as if they only care about themselves, in which case they 
would simply consume all they have before the economy’s and their end; 
implying that the government must also pay off its debt, since people will 
call it in and consume it. However, law and order (which rely on social 
institutions being respected) might also break down. The end of the 
economy is effectively the end of its inhabitants’ world; imagining how 
people would behave if the world were to end is hard, if not meaningless 
(an amusing attempt is in Douglas Adams, 1978, ‘The restaurant at the 
end of the universe’ where the whole point is that people cannot really 
imagine the end). 

This seems to force us to consider time which is truly infinite - 
that is, where the economy never stops. In this situation how can we 
interpret the constraint that the government must simply have its debt 
grow at less than the rate of interest? Clearly it may be asked to pay 
it off (unlike money, where it knows it will never be asked to redeem its 
fiat currency). Anyone redeeming state debt will force the government 
to sell the debt to someone else: solvency then requires that when resold 
it has a present value equal to its face value (at least equal — implicitly 
we assume that if its present value was greater, then taxes would be 
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cut or spending raised, in effect redistributing the excess back to the 
citizens.) On reflection, this is no more troublesome as a way of avluing 
government paper than the way we value say private equity (the present 
value of future dividends). 

Government debt in other words is sold at a price that reflects its 
‘value’ that is, its discounted stream of primary surpluses, its cash flow 
available for paying interest and repayment on the debt. The transver- 
sality condition on bt+N forces exactly this condition on that future debt. 

It may be useful to illustrate this point with an example. Consider 
three cases. 

First, let debt be increasing faster than the rate of interest. By 
definition: 


A b t = rb t - 1 - s t (5) 

Because > r, St. must be negative. This is the case of Ponzi 
finance, where debt is run up to pay off interest and pay for extra net 
spending. 

Suppose this situation prevails from time t + N + 1, where the re¬ 
mainder term of (7.3) cuts in, and let s be constant and negative. Then 
the present value of b t +N at time t + N can be written by discounting 
future surpluses as: 

11s 

p.v.of b t+N = a[— + - + (1 + r)jV+1 + -] - ~ (6) 

Plainly, s being negative the debt will be valueless. 

More generally, let debt be growing at the constant rate g(> r), so 
that = r — = g and thus = —(g — r) is constant, implying 

that primary surpluses are both negative and also growing at the same 
rate as debt, g. 

Then the present value of b t+ n becomes: 


p.v.of b t+N = 


St+N+l bt+N 
bt+N 1 + r 


St.+N+2 bt+N+1 
b t+N +i(l + r) 2 


= bt+N[-{g-r)][j^ ; + 

1 +ff . (1+ff) 2 . 1 

(1 + r) 2 (1+?’) 3 


jg-r) 
1 +9 


oo 

h+N ^2 
i= 1 


(i + g)\ 

(1 -I- r)* 


= —oo 


( 7 ) 


The present value of the government’s future cash flow in (6) is neg¬ 
ative and in (7) is infinitely negative; in both cases its debt at t + N, 
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b t+ jy, is valueless. In other words it will be impossible to issue debt at 
t + N + 1, bt+N+u to pay off bt+N\ it follows that b t+ N itself could not 
have been issued, given this was known. And so on, back down the chain 
to the present time, t. In period t the government will be unable to issue 
any more debt, 6t+i, and will therefore default on its current debt, b t . 

Secondly, consider the case where g = r. Then (7) equals 0. There 
are neither primary deficits or surpluses from t + N. 

The accumulated debt, b t +N, has no value, as there is no future cash 
flow to service it; hence there is also insufficient cash flow from t onwards 
to pay off the current debt, bt. The government is not therefore solvent; 
it will again be unable to issue new debt and will default on 6 t . 

Thirdly and finally, consider the case where g < r. Then (7) becomes 
finite and equal to b t .+N , in other words its face value. Hence the present 
value at t of future surpluses from t + N + 1 onwards in this case is 
equal to that of the debt which will have been accumulated up to that 
point by previous deficits, b t +N • The government’s outstanding debt is 
therefore ‘backed’: the government could pay off its debt at any time 
(by just surrendering its cash flow to creditors). The essential point is 
that with g < r, there is a primary surplus from t + N+1 which starts 
equal to (r — g)b t +N and grows at the same rate, g, as debt (hence in 
effect remaining equal to this growing value). 

The government’s solvency or transversality condition can therefore 
be seen to be the commonsense condition that at all times the debt must 
be backed by primary surpluses with a present value equal to the debt’s 
face value. 

If this were not to be satisfied then the market would react by mark¬ 
ing the debt’s price below face value. Call this price D t . At all points of 
time this price will move to make the market value of government debt 
equal to its present value. Hence we can write the price equation as: 

OO 

V St+i 

.. (l+r)‘ 

A = (8) 

What this means is that a government can issue debt which is not 
fully backed by its future policies but only at a price that reflects this. 
We have been dealing with these policies as if they are fully known in 
advance. In this case when future policies change from those that are 
solvent to those that offer inadequate present value, there will be an im¬ 
mediate and sudden fall in the price of outstanding debt below its face 
value. In other words, if the government announces policies which mean 
it cannot pay its debts, it is as if it is ‘writing down’ the obligations 
(coupons etc.) on those debts; which of course implies that their present 
value, D t (their price in the market-place) is correspondingly lowered. 
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In practice, when future surpluses are uncertain, we can think of r as 
the pure risk-free real interest rate and D t as the risk discount factor (in 
present value) on expected future surpluses when valued at the risk-free 
rate (their ‘face value’ as we have put it). Thus equivalently we could 
value these expected future surpluses at the risk-free rate plus the risk¬ 
free premium implied by D t , that is, at the market rate of interest on 
government bonds. This would be the bonds’ market value. This is the 
usual situation in the bond market where a government is following an 
official policy of servicing its debt fully (no default). When a govern¬ 
ment defaults, that is announces it will no longer pay its interest and 
repayment obligations on old bonds in full, then D t is marked in the 
market as an explicit discount (e.g. 20 pence in the pound) on the old 
bonds when valued at the rate of interest applicable to any fresh debt 
the government might raise. 


THE ‘FISCAL THEORY OF THE PRICE LEVEL’ 

In the past few years a surprising literature has grown up claiming that 
the (general consumer) price level, Pj, can be determined by fiscal pol¬ 
icy — for example, Sims (1994) and Woodford (1995) and see Buiter 
(1999) for a critique on which this section draws. The claim can be 
seen quite simply by considering equation (4) above, the solvency con¬ 
dition. In that equation assume that instead of issuing indexed bonds 
the government has outstanding nominal bonds, so that b t = -p 1 ; let 
B t be the market value of these bonds in money terms, computed by 
the normal discounting method given current interest rates (thus for ex¬ 
ample if bonds were perpetuities paying £1 each period their present 
value would be the number of these bonds divided by f? t the long-term 
rate of interest). Now suppose the government makes plans for future 
primary surpluses (on the RHS of the equation) whose present value is 
lower than the real value of the debt — this policy these authors call 
‘non-Ricardian’ (if the government plans an RHS equal to debt’s real 
value, a ‘Ricardian’ policy, then none of the following is relevant; the 
price level is then not claimed to be affected by fiscal policy — this use 
of ‘Ricardian’ has nothing to do with our later discussion of ‘Ricardian 
Equivalence’) . The Fiscal Theory states that prices will adjust to reduce 
the real value of the debt to equality with this RHS in order to produce 
solvency at the new real value; in effect this adjustment is a devaluation 
of the debt in response to fiscal policies that do not give it sufficient 
‘backing’. 

One oddity of this theory is that it implies the price level will be 
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overdetermined under the usual assumption that the government sets the 
level of the money supply as well as its rate of growth (seigniorage). To 
avoid this the theory assumes that the government sets the interest rate 
and not the level of the money supply; it does set seigniorage however 
(hence we can think of the interest rate path being set by a rule that 
targets the growth of the money supply or inflation and possibly real 
variables also). In a flexible-price economy this leaves the price level 
indeterminate because the choice of interest rate only sets real money 
balances demanded and cannot fix the split of these between nominal 
money supply and prices. Thus the Fiscal Theory claims that in such a 
situation the fiscal plans will set prices via (4) as above. 

We can immediately note that this argument does not work in a 
model where prices are sticky (as in overlapping wage contracts) and 
there is some Phillips Curve mechanism translating excess demand into 
current inflation; for in this case an interest rate rule will fix the price 
level because it will fix excess demand, so inflation and so given past 
prices the current price level. So in this case, too, as in the flexible-price 
case where money supply is set, there would be overdeterrnination of the 
price level. 

Overdeterrnination of a variable is fatal to a theory because it implies 
an internal inconsistency. Thus we can say at once that for many possible 
economies the Fiscal Theory of the price level will produce an incoherent 
theory overall. Furthermore, we can see from our discussion in the above 
section that the Fiscal Theory leaves out the way in which insolvency is 
dealt with by bond markets, viz. an adjustment of the bond price itself 
(Dt), not of the (goods) price level over which bond markets have no 
control. 

Can we accept the Fiscal Theory as a possible mechanism for deter¬ 
mining the price level where it is left indeterminate by an inadequately- 
specified interest rate rule? Not really, because the bond market, facing 
an equation (4) that was not satisfied, would have two degrees of free¬ 
dom — either adjusting D t or P t . Hence either the price level or the 
bond price would be indeterminate. The model under these policies 
would thus be inadequate to explain a world in which both are seen to 
be determinate. Implictly Fiscal Theorists impose D t = 1 to resolve 
this problem: but this is not merely arbitrary, it is at variance with the 
facts of debt downgrade (that is, a rise in the risk-premium added to the 
risk-free rate). 

The Fiscal Theory therefore does not solve the problem of price level 
indeterminacy due to an inadequately-specified interest rate rule; such a 
world cannot therefore apply since we observe determinate prices. Under 
other assumptions it produces overdetermination of the price level — and 
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thus must also be rejected. Buiter has called this theory a ‘fallacy’ — a 
correct assessment in our view if we interpret it in the bald way we have 
set it out above. 

There is however a constructive way to think of the theory: as defin¬ 
ing a particular combination of fiscal and monetary policies. By such 
a combination we mean a set of government plans that, given existing 
debts and projected real interest rates, 

(a) set primary surpluses through spending and tax plans 

(b) set the money supply (either directly or indirectly via plans for 
interest rates and prices). 

We can distinguish four types of plan: 

1. ‘Classical’ monetary policy leadership: the money supply is set 
implying a price level sequence (or alternatively an interest rate 
rule targeting inflation or the money growth rate from a given ini¬ 
tial price level: this implies a determinate money supply sequence). 
Fiscal policy sets surpluses required to validate the real debts given 
this price sequence. 

2. ‘Sargent-Wallace fiscal leadership’ (discussed further below under 
‘selfish overlapping generations’): here the current money supply 
is set to fix current prices but fiscal policy implies deficits which 
require a further inflation (and implied price level) sequence to 
validate the debt, so defining the future money supply sequence 
(or equivalent interest rate rule alternative as in type 1 policy). 
Both policy types 1 and 2 are ‘Ricardian’ in terms of the fiscal 
theory. 

3. ‘Non-Ricardian’ fiscal leadership: fiscal policy implies deficits and 
an inflation path is set via future money supply growth (or im¬ 
plicit future interest rate policy); to validate the debt a current 
price level rise is required, implying a current money supply jump 
(or equivalent current interest rate policy). Hence type 3 is to 
be understood as fiscal leadership like type 2, only whereas 2 fixes 
current prices via monetary policy, 3 fixes future inflation via mon¬ 
etary policy; thus in 2 future monetary policy must reflect fiscal 
requirements whereas in 3 current monetary policy must do so via 
a jump in the levels of both money and prices. 

In none of the above cases does the government plan a default 
which leaves: 

4. ‘Planned default’: here fiscal policy implies deficits while monetary 
policy fixes a price path that cannot validate the debt. The con¬ 
sequence is that the debt is automatically devalued by a fall in its 
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own price, D t . This is the case of deliberate outright default with¬ 
out resort to ‘indirect default’ via price-devaluation (3) or inflation 
tax (2). 

(l)-(4) is thus a classification of possible fiscal-monetary strategies. 
The ‘fiscal theory’ can then be thought of as drawing attention to 3. The 
essential point is that behind monetary policy lies a fiscal constraint: the 
government in the end has to plan how this will be met and the monetary 
(and hence price) consequences, present and future, are one part of this. 


MODELS OF FISCAL EFFECT 

The Ricardian equivalence model 

In considering how fiscal policy might affect the economy, we begin with 
the model of ‘Ricardian equivalence’, named after Ricardo, who sug¬ 
gested the model without really believing it. If we assume that house¬ 
holds have an infinite life, that all taxes are lump-sum (so creating no 
distortions), and that markets are complete (no unrealised opportunities 
for insurance or lending exist), then household consumption will depend, 
through intertemporal optimization, on the real rate of interest and per¬ 
manent income. It follows that government spending plans will require 
a certain permanent level of taxation-plus-seigniorage in line with our 
previous discussion. Let us hold seigniorage constant as it is to do with 
monetary policy. If the government varies the pattern of taxation be¬ 
tween periods, it will make no difference to this required permanent level. 
Cutting taxes in the present by TCt for example will simply imply more 
taxes later, by an amount equal in present value to the tax cuts: for the 
extra debt issued will be rolled up with interest until future extra taxes 
are raised to pay it off, say in t + N. When they are, the accumulated 
extra debt will be equal to TC t { 1 + r) t+N and its present value will 
therefore be this discounted by the same interest rate, thus simply TCt- 
This implies that households will treat the extra bonds they hold as not 
providing them with any extra wealth; thus government bonds are said 
to have no ‘wealth effect’. 

This is not to be confused with the idea that government spending 
changes have no effect: they will in principle. A higher permanent level of 
spending, the increase spread equally across periods, will raise permanent 
taxation and so lower permanent income by an equal amount. However, 
even Though they must reduce their permanent consumption by this 
amount, households may not reduce their spending in the current period 
by it; for example if interest rates are currently very high so that current 
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consumption is low, they may reduce consumption proportionately in all 
periods given the pattern of interest rates, which in turn implies only 
a small absolute cut in the current period. Hence there would be a 
‘balanced budget multiplier’ effect here, putting interest rates up still 
further. 

If government spending is increased temporarily, with permanent 
spending held constant, then households will not change their consump¬ 
tion at given interest rates, so implying a rise in aggregate demand which 
will put upward pressure on interest rates. So the pattern of government 
spending over time will affect demand and the economy and the per¬ 
manent level of government spending may also do so. The Ricardian 
Equivalence point applies to the pattern of taxation. 

Its implications are that fiscal policy, interpreted as bond-financed 
tax changes, has no effect on the economy. Since there are also no 
distortions to be affected as taxes are speeded up or deferred through 
borrowing changes, it is a matter of supreme indifference in this model 
whether the government runs deficits or not in financing its spending 
plans. The only element in fiscal policy that matters is the level of those 
plans and their pattern over time. However it is usual to define fiscal 
policy to exclude this (considering it as allocative or strategic policy) 
and to refer to the extent of deficits: on this definition fiscal policy is 
irrelevant to aggregate demand. 

Plainly too there is no relation in this model between deficits and 
monetary policy; a higher deficit requires no extra money supply growth 
to reduce the strain of financing it by bonds as there is no strain because 
private agents spend the same and so provide extra savings exactly equal 
to the required extra borrowing. Nor does a higher deficit increase in¬ 
terest rates, tightening monetary conditions (as in the IS-LM models 
we have used up to now). Monetary policy can be chosen entirely inde¬ 
pendently of fiscal policy (the only relation would be with the perma¬ 
nent level of taxes, which being lump-sum impose no distortions whereas 
seigniorage creates distortionary costs through money demand).The Ri¬ 
cardian assumptions can of course be relaxed, to dilute this harsh impli¬ 
cation of a zero wealth effect. First, if taxes are not lump-sum, then a 
change in their intertemporal pattern will cause distortions, which will 
have real effects. We will discuss below (model (c)) how this alters the 
situation. 

Secondly, we may appeal to various failures in the ‘complete markets’ 
assumption. These are not likely to have a large effect quantitatively; 
but one example is given in Box 7.1. 
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Box: 7.1 

AN EXAMPLE OF MARKET INCOMPLETENESS: 
INCOMPLETE INSURANCE. 

Suppose each individual faces considerable uncertainty about how 
much tax he and his heirs will pay; he does not know future income, 
future family size, the possibility of emigration, etc. Let us ignore 
emigration, in which case the change in tax prospects will have no 
impact on him; this no doubt affects only a minority. Let us assume 
that the tax system, as is typically the case in Western economies, 
is progressive; also assume that people are identical and risk averse. 
Suppose everyone receives the same transfer and holds it in the 
form of bonds. Then the individual will perceive his and his heirs’ 
potential net income after tax as follows: at the one extreme they 
will be poor, receive the bond interest, but pay little tax, from there 
progressing with higher income towards the other extreme where 
they will be rich, receive the bond interest, and pay a lot of tax. If 
taxes are raised they will pay little more if poor, but significantly 
more if rich. This effect of a high covariance between tax and income 
is illustrated in figure 7.1 (for a simplified distribution with only 
‘poor’ and ‘rich’ states of equal probability). 

We are interested in the change in the ‘certainty equivalent’ of each 
man’s future net income after tax, by this we mean the sure income 
that would yield him the same utility as the income possibilities he 
actually faces. Before the bond transfer and consequent future tax 
liability, let him have an expected net income of Ey, the average of 
his ‘poor’ and ‘rich’ states; the expected utility of this is EUq and 
the certainty equivalent is yo- After the bond transfer, everyone’s 
expected net income remains the same because the tax payments 
averaged across the two states must equal the bond interest receipts 
for the government’s budget to balance in the future. But now each 
person will be better off than before when poor (he received the 
bond interest but his extra tax burden is less than this), and worse 
off than before when rich (his extra tax burden exceeds the bond 
interest); the arrowed line on figure 7.1 joins these two states, and, 
because of the insurance he receives in effect, his expected utility 
rises to EU\ and his certainty equivalent income to y\ so that the 
bond transfer 
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increases private wealth. Barro (1974) has further pointed out that, 
if the tax rate is raised when taxes go up to pay for the bond interest, 
then this increases the progressiveness of the tax system and this 
insurance effect is enhanced, as illustrated by the dashed line with 
higher expected utility EU 2 and certainty equivalent income j /2 - 
Clearly this insurance effect of bond issue would be eliminated if the 
private insurance market already provided full insurance. However, 
this is unlikely because of incentive incompatibility (see e.g. Hart, 
1983); full insurance gives the insured person an incentive to lie 
about his poverty or fraudulently to avoid trying to be rich. 


rich 



Figure 7.1: Certainty equivalent and expected net income after tax 

Thirdly, we may drop the assumption of infinite life; this in practice 
is the most important relaxation. Plainly current households’ members 
do not live for ever. But Barro (1974) has shown in a model of over¬ 
lapping generations that, if each person leaves a bequest, then everyone 
acts as if he is infinitely lived. The reason is that the transfer he receives 
diminishes the utility of his heirs while raising his current resources. Yet 
he has already decided, having left them a bequest, that his optimal 
course is to give them their previous utility at the expense of his current 





194 


Evaluating Government Policy 


resources; therefore he will raise his bequest in order to offset the gov¬ 
ernment’s action, in effect saving the whole of the transfer for his heirs. 
The transfer, by not changing his opportunity set, leaves his net wealth 
and behaviour unchanged. 

If he has not planned a bequest, then the level of utility he desires for 
his heirs cannot be established; the transfer may or may not push their 
utility below his desired level. Hence we cannot establish how much, if 
at all, he will offset the transfer by saving. 

It is of interest that the lack of bequests might not matter, because 
a ‘bequest’ could be negative (parents could receive transfers from their 
children) as well as positive, a continuous variable; hence the government 
transfer would always be offset. What matters is that one generation 
cares about the other, so that the government’s redistribution from one 
to the other is offset (accidental bequests, e.g. because death comes 
unexpectedly early, are beside the point). 

Barro’s argument can be shown formally by simply setting out each 
generation’s lifetime budget constraint. Suppose first of all that there is 
no government borrowing or taxation; then generation 1 has a life-time 
budget constraint viewed at t + 1 (when this generation is old) of 

Vt+i + Vt ( 1 + r ) = c t+i + 4 (! + r) + Qt+i (9) 

where qj +1 is its bequest to generation 2, made at the end of t +1 period; 
y = income, c ^consumption, both in real terms, and r is the real rate of 
interest (assumed constant over time in this base-case situation). Super¬ 
scripts denote the generation. Generation 2’s lifetime budget constraint 
viewed from period t + 1 is: 


Vt +2 
1 + r 


+ Vt +1 + Qt+i — 


r 2 

c t +2 
1 + 1 ' 


+ c t +1 


q ?+2 

1 + r 


( 10 ) 


Now let the government transfer the amount b t in t to generation 
1, then young, and announce the plan to tax generation 2 by T 2 +l = 
bt( 1 + r) when it is young, in t + 1, meanwhile offering bt bonds at t to 
the loan market. Then the two generations’ constraints become: 

(gen. 1) 


Vt +1 + Vti 1 + r ) + M 1 + r ) = cj +1 + cj(l + r) 

1 + Qt+1 

(11) 

(gen. 2) 



1 + r + + q t+i b t( 1 + r ) i +r +c ‘+i 

, q t+2 

1 + r 

(12) 


If generation 1 had already optimally planned a bequest designed to 
give generation 2 a certain lifetime consumption possibility while leaving 
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itself with a certain other one, then the government’s action, if not offset 
somehow, will raise its own consumption while reducing that of gener¬ 
ation 2, which is by implication suboptimal. However, if generation 1 
raises its bequest exactly by the amount of the transfer grossed up by 
interest received from putting it into savings, then both generations will 
be exactly as well off as it originally planned; hence the transfer is saved 
by generation 1, which thereby buys the bonds the government offers. 

The Model of Selfish Overlapping generations: 

Unpleasant Monetarist Arithmetic 

Whether generations are linked with each other in Barro’s sense is an 
empirical matter. It may vary across and within societies. It seems 
reasonable to explore a model which is at the other end of the spectrum 
from the Ricardian and assumes each generation cares only about itself. 
Such a model is for example used by Sargent and Wallace (1981) in a 
paper designed to investigate the upper limit of deficits when money 
supply growth is being held down by ‘monetarist’ policies. 

In their overlapping-generations (OLG) model, debt is held by the 
young, to be redeemed from them when old (the old will only wish to 
consume, in the absence of heirs). In such a model the government 
clearly cannot borrow more from the young than their current income. 
We will consider below (Chapter 12) the detailed workings of such a 
model. But plainly the pattern of taxation can have large effects in it; 
for example a tax cut financed by borrowing throughout the lives of the 
current generation, so that taxes are raised on the next generation, or 
perhaps indefinitely deferred, must raise the consumption of the current 
generation as it will raise its permanent income. 

Let us turn to the connection of fiscal with monetary policy. As we 
will see in Chapter 12, borrowing in an OLG model is limited by the 
capacity of the younger generation to lend; the old will not lend since 
they wish to consume all their wealth before they die. The young will 
lend as much as they can be induced to through higher interest rates; 
let us assume that the government is deterred from raising interest rates 
above some normal level, r, by the higher debt servicing costs this will 
imply. Then at this interest rate there will be some maximum amount 
the young are willing to lend: call this the government’s debt ceiling. 

Now consider how the ceiling on debt would affect a simple macro 
model. Suppose that GDP, N (also standing for the population by an 
appropriate choice of indices), grows at the rate n per year and that the 
real rate of interest, r, is constant and greater than n. Let H* = rate of 
growth of money, ' = per capita value, 7T = rate of inflation, P = price 
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level, bt = the value of one-year indexed bonds. Write the demand for 
high-powered money, H, in the quantity theory manner as: 


H t = hP t N t (13) 

so that fP t * = 7 r t + n. Suppose now that at time t = 1 new intentions 
are announced and carried out, for future fiscal and monetary policy; the 
announcement (fully believed) changes the present real value of bonds to 
6i as prices and interest rates react. There are then two phases of policy: 
‘transition’ from t = 2, ..., T— 1, and ‘terminal’ from t = T, ..., . During 
the transition phase policies may be different from their terminal phase 
when they must be in steady state. The government budget constraint 
is from t = 2 onwards: 

b t - b t -1 =d t + rbt-i - Ht (14) 

Ut 

Expressing (14) in per capita terms, dividing all though by N t , gives: 

bt _ dt H t -i H t — H t -\ (l+?’)6t_i . . 

Wt ~ N't ~ PtN t H t -i (1 + n)N t - 1 1 ; 

Using P t N t = (1 + Ht)P t -\N t -i , this becomes 

h H* 

b 't = 4 ~ Y+H* + + r _ n ) 6 't-1 ( 16 ) 

Since r > n this is an explosive difference equation if interest-exclu¬ 
sive deficits and monetary targets are pursued independently. Suppose 
that during transition a constant H* and d are chosen: these policies are 
monetarist so that H* is ‘low’ but fiscal policy is ‘expansionary’, so that 
df > ytjp ■ The set limit on this per capita stock of debt b ', discussed 
above, is now bound to be reached at some point. At the date when this 
occurs, b' t = b ', then policies have to change so as to ensure: 

h H* 

0 = b’ t+1 - b[=d' - TW + (r - n)b' t (17) 

that is, that real bonds do not change any more. However, the policies 
could have been chosen so as to change before this, so that b' t < b' at this 
point. In general, the terminal date T, for the switch to a sustainable 
policy with unchanging per capita debt can be chosen freely from t = 2 
onwards, so that Ht and dy, the terminal or steady state policies are 
governed by: 

h H* 

0 = b' T — b' T _ 1 = d' T — ——-jjr + ( r — n)b' T _ 1 (b' T < b 1 ) (18) 
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The point is that there is a trade-off between the transitional policies 
(including the length of time they are pursued) and the terminal policies 
because the transitional policies affect the terminal stock of debt: 

b' T - 1 = (1 +r-n) T ~ 2 b , 1 + ^(1 + r-n) l ~ 2 fd' - (19) 

i =2 ' ' 

The trade-off implies each of the following: 

1. If the government wishes to maintain a constant interest-exclusive 
deficit d! = d' Tl then the smaller is current (transitional) money 
supply growth, the larger will future money supply growth H* r 
have to be. Given fiscal ‘profligacy’, there is therefore a trade-off 
between current and future inflation. 

2. If the government wishes to maintain constant money supply growth 
H*, then the higher are the transitional deficits the larger are the 
future surpluses that will be required. Given monetary discipline, 
there is therefore a trade-off between current and future fiscal dis¬ 
cipline. 

The message is, in short, that tough monetary policies require tough 
fiscal policies (called by Sargent and Wallace ‘unpleasant monetarist 
arithmetic’). 

The role of long-dated nominal bonds 

We have so far neglected the term in b' 11 the value of real bonds after 
the policies are announced; implicitly we have suggested that b\ was 
quite small. This then allows scope for policy makers to choose between 
trade-offs 1 and 2 above. If, however, b[ is large, and close to then 
there is little scope for choice; the policy makers are forced to go rapidly 
to steady state fiscal-monetary policies, hard as these must be. 

If the government bonds are nominal and short-dated, then the reval¬ 
uation due to policies of lower inflation will be small unless the change is 
drastic; hence for example on a one-year nominal bond Iiq. the change in 
value will be —bo(H* — H* } e ) where H*^ is the money supply growth ex¬ 
pected before the policy change. If the bonds are indexed as we assumed 
above, then the revaluation will be nil (given the fixed real interest rate 
assumption). 

If the bonds are long-dated, the revaluation effect can be very large. 
For example, take a bond paying a fixed money amount, Mk. on ma¬ 
turity at t = K. The present value of this at t = 1 expected at t = 0 
was 6 q = where Rq = H q® + r — n was the nominal interest 
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rate at t = 0; the actual present value at t = 1 is then ( 1 + ^)a--i where 
R\ = H* k + r — n and H* K = H* + H^ (that is, a weighted 
average of the transitional and the terminal H*). Hence the unantici¬ 
pated capital revaluation on such a bond is —b 0 (K— 1)(H* K — Hq c ). For 
a credible anti-inflation policy where H* K = = n, this revaluation 

will be a K — 1 multiple of the terminal fall in inflation, and create the 
necessity for harsh fiscal discipline with much greater rapidity. 

Definitions of the ‘deficit’ 


Several definitions of the government ‘deficit’ are in use: inclusive or 
exclusive of debt interest and, if inclusive, inclusive of either nominal 
or real debt interest. We can re-express the restrictions placed on fiscal 
policy in terms of these different definitions. 

Consider a version of the government budget constraint expressed in 
nominal terms in the steady state: 


A H t + A B t = (g- r)P t N t + R t B t (20) 


B is the nominal market value of bonds (in steady state interest rates 
will not be changing), R is the market nominal interest rate, r the tax 
rate and g the share of government spending in GNP (both assumed 
constant); H. P, ir and N are respectively money supply, prices, inflation 
and GNP as before. 

From our previous analysis, we constrain the ratio of debt to GDP 
to a constant in steady state so that: 


A B t 
B t 


= 7r + n(the growth of nominal GNP) 


Equation (20) can be written as: 

A H t H t A B t B t B t „ 

+ ~^P^Nt ~ 9 ~ t+ RN t Rt 


So that using (21): 

AHt -r , 77 v 
—— =v[g-T + b(r-n)\ 


( 21 ) 


( 22 ) 


(23) 


where v is the equilibrium velocity of money and b the equilibrium ratio 
of debt to GNP. 

Equation (23) says that the steady state growth rate of money de¬ 
pends upon the ratio to GNP of the steady state deficit inclusive of 
real debt interest (sometimes called the ‘inflation-adjusted real deficit’), 
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minus an allowance for growth, bn. We can also note that since in 
equilibrium^^ = 7r + n, 

(tt + n)(H t + B t ) = R t B t + (g-T)P t N t (24) 


or 


A H t , R t B t + {g-r)P t N t PSBR t P t N t 

H t H t +B t P t N t Ht+Bt 

which says that money growth equals the public sector borrowing re¬ 
quirement (PSBR) to GNP ratio times the ‘velocity’ of government net 
financial liabilities (‘outside money’, discussed in the next section). 

Equations (23) and (25) are of course exactly equivalent, although 
one uses the inflation-adjusted deficit while the other uses the unad¬ 
justed deficit. However, when (23) is used to assess what fiscal policy 
must be used to validate a certain counter-inflationary monetary policy 
(e.g one to reduce A H t /H t to n from some high level), great care must 
be taken to include in b the effects of falling inflation and interest rates 
on the value of outstanding bonds; this adjustment can be very large as 
we saw above when a large proportion of these bonds are non-indexed 
and of long maturity, so that large cuts in the government deficit ex¬ 
cluding interest may be necessary. When (25) is used, the implications 
are more transparent since nominal debt interest will not change except 
for short maturity stocks, which are rolled over before inflation comes 
down. These remarks are relevant to the debate on the Thatcher govern¬ 
ment’s Medium Term Financial Strategy, which did not always carefully 
observe this point (for example, Buffer and Miller (1981) incorrectly ar¬ 
gued that the fiscal policies were ‘unnecessarily’ restrictive using a crude 
adjustment for current inflation on debt at current market value). 

Given the overall policy requirement of fiscal-monetary consistency, 
the application in any situation will be largely a question of what is 
politically feasible. This is particularly true of the short-run time path 
immediately on announcement. It may well be wise, for example, to cut 
public spending rapidly as money supply growth is cut, even though this 
implies a ‘real’ budget surplus (that is, inclusive of real debt interest) in 
the first few years because during this period debt interest on long-dated 
stock is still offset by high inflation. This real surplus will disappear as 
soon as inflation comes down, because the debt interest on long-dated 
stock will fall away only very slowly. Then, with the spending cuts done, 
the budget deficit will be at steady state levels and some of the debt 
revaluation (on the long-dated stock) will have been worked off by the 
previous real surpluses; this is an illustration of the previous section’s 
discussion. 
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A Neo-Keynesian model with wealth effects 

We can embed the logic of the OLG model above with its wealth effects 
of government bonds within a conventional IS-LM model of the type we 
used in earlier chapters; the advantage of doing so is that while the con¬ 
clusions of the OLG analysis follow also in this conventional model, it can 
additionally be used for analysis of short-term stabilization policy. To il¬ 
lustrate these points we represent the government budget constraint in a 
different but convenient way. Private sector net financial wealth consists, 
in a closed economy (which we continue to assume), of government bonds 
and high powered money (the ‘monetary base’, consisting of the notes 
and coins issue plus commercial banks’ deposits with the central Bank), 
that is, government net financial liabilities. These are known as ‘outside 
money’ (following an earlier literature — e.g. Patinkin, 1965; Metzler, 
1951; Gurley and Shaw, 1960). Bank deposits are ‘inside money’ in that 
banks are a private sector institution; their deposits are therefore both 
assets and liabilities of the private sector, cancelling out in net terms. 

Let / be the stock of government net financial liabilities (hence gov¬ 
ernment debt) in real terms (that is, deflated by the consumer price 
index). / will rise for two reasons: first, a government deficit will create 
new liabilities, and second, the existing stock of liabilities will be subject 
to capital gains, as the price of bonds rises or the consumer price index 
falls. We write: 

= ^ = ( qAR t + A p t ) (26) 

Jt -1 b 

where d is the (total, inclusive of debt interest) government deficit as 
a fraction of GDP, b is the ratio of government debt to GDP, q = is 
the proportionate response of long-term bond prices to the long-term 
interest rate (i?); and 8 = logf. The unit coefficient on p t (the log of 
the price level) reflects our assumption that all government liabilities are 
denominated in money terms. We will view (26) with 6, q held constant 
at some average value as an appropriate approximation. 

To (26) we add a relationship determining the supply of high-powered 
money (in logs), which we shall write as m (which we can also treat as 
total money, reflecting the convenient and conventional assumption that 
there is a fixed ‘money multiplier’ between the two). We write: 

$ 

A m t = —(d t — d) + Am + e t (27) 

b 

where d is the equilibrium (steady state) government deficit as a fraction 
of GDP, e is an error term, and Am is the equilibrium rate of growth of 
money. Equation (27) states that out of equilibrium money supply will 
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have an independent random component (to which we could add other 
independent temporary determinants of money if we wished) as well as a 
component responding to the temporary component of the deficit. Given 
(26), (27) implicitly also determines the supply of nominal bonds as the 
difference between nominal financial assets and the monetary base. 

Equation (27) focuses on two aspects of monetary policy with which 
we shall proceed to deal. First, how far does the equilibrium growth 
rate of money reflect the government deficit? Secondly how far should 
the money supply growth rate be varied (over the ‘cycle’) as budgetary 
financing needs change? In other words, what are the links between 
fiscal and monetary policy, first in, and second out, of steady state? 
This distinction is an important one in rational expectations models, as 
we shall see. 

We begin with behaviour out of steady state. 

Stability and Bond-Financed Deficits Out of Steady State 

One issue that was given great prominence from the early 1970s un¬ 
til the general acceptance of rational expectations was the possibility 
of instability in models with wealth effects, if budget deficits are bond- 
financed. This can be illustrated in a simple fixed-price IS-LM model 
without rational expectations (a log-linear adaptation of Blinder and 
Solow’s 1973 model). We use non-stochastic continuous time and ab¬ 
stract from the steady-state relationship between money and deficits by 


setting d = Am = 0: 



yt 

= k6 t - ar t + 4>dt 

(28) 

m t 

= P + Sy t - (3r t + yd t 

(29) 

e t 

d t 

= y-gh 
o 

(30) 

d t 

= 9 ~ry t 

(31) 

m t 

® , 

— ■=■ d+ 

b 

(32) 


~g , government expenditure as a fraction of GDP, includes debt interest: 
this formulation assumes that other expenditure is reduced as debt in¬ 
terest rises. If it were not, the instability under bond finance discussed 
below would be severely worsened. ry t measures marginal tax receipts as 
a fraction of GDP and hence r is the income elasticity of taxation minus 
one: initial average tax receipts are netted out of g. ' denotes the time 
derivatives. Because prices are fixed at p, r t is both the nominal and real 
interest rate. Equations (28) and (29) are the IS and LM curves with 
wealth effects; (30)-(32) are the budget constraint and money supply 
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relationship for this model. 

The model solves for y t and r t given 9 t . Using (32) and (30) gives 

rnt= <E> {6 t +q r t ) so that m t = d>(d t + qr t ) + K m , where K m is an 
arbitrary constant. Substituting for m* from this into (29) and for dt, 
from (31) into (28) yields the equations for y t and r t . Substituting the 
solution of them for y t and r t into (30) yields the equation of motion for 
6 t as: 

— = {kd — ay + $kq + <f>a) 

a _ b y _____ — 9 1331 

4 (1 + (/)t)(( 3 + qy) + S(a + qk) 4 

For stability we require (k/3 — ay + $kq + <E>a) > 0. Clearly if money 
supply is held constant regardless of the deficit, i.e $ = 0 (bond financed 
deficits), then we must have k/3 > ay, which raises the possibility of 
instability if there are relatively strong wealth effects in the LM curve. 
$ > 0 reduces the possibility; Blinder and Solow and others have accord¬ 
ingly advocated money-financed deficits as a means of avoiding possible 
instability. 

This instability is illustrated in Figure 7.2. In addition to the IS and 
LM curves we have drawn in a 'WW' curve, which is the equation of the 

budget constraint, (30), showing the level of output where 9t= 0, that 
is. there are no changes in wealth. For 0, we must have both d t = 0 

and rt= 0; r t = 0 automatically when 9 t = 0 because as we have seen, 
r t and yt (the intersection of the IS and LM curves) depend on 9 t and 
cease to move when 9t stops moving. This level is y = ^since from (31) 
d t = 0 at this point. To the right of the WW curve, 9 t is falling (and 
rising to the left). The IS curve shifts leftward as 9 t falls but the LM 
curve shifts rightwards. Instability under bond-financing (<E> = 0) occurs 
when the intersection moves rightwards (and down) that is, k(3 < ay, as 
the effect of 9 t on y t (from equations (28) and (29)) is 

k3 — ay 
f3 + aS 

This can be seen diagramatically by noting that the rightward shift 
of the LM is yO, the leftward shift of the IS is k9 ; but the flatter the IS 
curve (the higher a) the more the LM curve shift dominates the output 
movement, and vice versa the flatter the LM curve (the higher (3). 

We pointed out that there is additional instability if one assumes 
government expenditure is fixed and does not fall to offset debt interest. 
The interested reader can work this case out, using in place of (31): 

d t =g-ry t + rb9 t + br t 


(30') 
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Figure 7.2: Instability with bond-financed deficits 

where the last two terms approximate debt interest around some average 
interest rate, r, and an average financial asset to GDP ratio, b. The 
reader will find that the equivalent of (33) contains a number of extra 
positive terms in the numerator, increasing the chances of instability. 

Under rational expectations, however, this Blinder-Solow argument 
carries less force. To convert their model into a rational expectations 
form, it is sufficient to recognize that the valuation of financial assets 
is forward-looking, that is, it depends on expectations of future interest 
rates; for convenience, now use discrete time. Returning, for example to 
the model given by (28) to (32), replace (30) and (32) by: 

A 9 t = =d t - q(E t r t +1 - E t _m) (29') 

o 

A m t = id t (31') 

b 

If the model is now solved by the methods of chapter 2, we obtain 
(if we drive the forward root backwards) a second-order characteristic 
equation in which one of the roots (the inverted forward root) should 
be absolutely greater than unity for a unique stable (saddlepath) solu¬ 
tion (i.e. having both forward and backward roots stable). The roots 
involve all the coefficients and there is no general condition to ensure the 
saddlepath property. 
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If $ = 0, the characteristic equation x t + ax t ~ 1 + bxt -2 = 0 with 
roots oq, a 2 has: 

D + = (&/? — a/x) 
a = -(^+^) = -l + #t + (1 + 0T)(l] 

6 = = + + fr-)p] <34) 

where D = (1 + </>r)/3 + (5cr. 

Since 6 < 0, the roots cannot be complex, and at least one of the roots 
must be negative (alternating motion) and the other positive (monotonic 
motion) which is consistent with a saddlepath. For example, take the 
following parameter values, which approximate those of the Liverpool 
model of the UK: 

/? = 2, (f> = k = 6 = 1, t = 0.3, a = 0.5, q = 3, /x = 0, i = 2 

These give b = —1.03, a = 0.43. Hence the roots are <ti, a 2 = 
0.82, —1.25: the monotonic saddlepath (figure 7.3a). Had we found 
by contrast /t = 5 so that k/3 < we would have had a 1 , a 2 = 
1.015, —0.135; again a saddlepath, but this time with alternating motion 
(figure 7.3b). 

Interestingly, in the particular example here, the ka < a^i = 5) is 
actually more stable in that, although it is alternating, the absolute value 
of the stable root is much lower than in the case of /x = 0. Computer 
examination of a wide range of values for the parameters suggests that 
problems with saddlepath stability arise whether fc/3 > or < afj, . 

If k/3 > a/x, there is saddlepath stability when q or ^ are low (oth¬ 
erwise the roots have a tendency to be both less than unity in absolute 
value: a ‘non-uniqueness’ problem). If fc/3 < a/x, there is saddlepath 
stability when q is high (or in some cases when /x is high, even though q 
is low); otherwise the roots tend to be both greater than unity. The two 
typical cases of saddlepath stability are shown in figure 7.3 

We conclude this section negatively: there is no compelling reason 
within rational expectations models to believe that the cyclical or short- 
run component of monetary policy ought to be influenced by fiscal policy. 
The decision, for example, whether to pursue a constant money supply 
growth rate through the cycle even though the budget deficit will be mov¬ 
ing cyclically, can be taken on other grounds, notably those appropriate 
to stabilization policy. We now turn to the steady state component of 
the money supply rule, where the situation is quite different. 

Long-term monetary targets 

We now take the previous model, but abstract from short-run behaviour 
while reinstating the possibility of steady state inflation. We have in 
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(a) Monotonic saddlepath with Kp>mi (— >0 — > 0 1 ; A0t, Ah+i ha\e same sign 

’ ae 


(b) Alternating saddlepath with Kp<ap 


dy . Sr 

^ <u > ^ >u | ; Aft, Ah+i have opposite sign 


Figure 7.3: Two possible types of motion under rational expectations 


equilibrium (* values are equilibrium ones): 


y* = k6* — ar* + (j)d 

(35) 

m* =p*+ 6y* - (3R* + yd* 

(36) 

Ad* = ld- qAR* - Ap* 

(37) 

d = g-Ty* 

(38) 

R* = r* + A p* 

(39) 

Am* = Am 

(40) 


We assume y* is exogenously given as the natural rate in an (omitted) 
Phillips Curve. 

The question we wish to ask is: can A m be chosen independently of 
d? For simplicity assume a steady state growth rate of zero (Ay* = 0), 
although this does not affect the argument. Assume also that d and Am 
are chosen to be constants. 

The first thing to notice is that, if in steady state both real interest 
rates and inflation are constant, then at once we have A R* = 0 from 
(39), A 9* = f A r* =0 from (35); then from (36) and (40) we have 
Am = A?n* = A p*, while from (37) we have =d =A p*. It then follows 
that = d = Am; monetary and fiscal policy have to be ‘consistent’, that 
is the rate of money supply growth has to be equal to the deficit as a 
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fraction of financial assets, =d. This in turn is equal to the rate at which 
nominal financial assets are growing (AO* + A p*); hence if money is 
growing at this rate, so also are bonds. Consistent monetary and fiscal 
policy hence implies that money and nominal bonds must be growing at 
the same rate. 

What happens if they do not? Suppose for example we place a 
terminal condition on inflation reflecting a government inflation tar¬ 
get achieved via money supply growth. So inflation will be constant 
in steady state. We may then derive by a similar procedure to the one 
already used: 


A r* 


_ k _ 

k((3 + q) + a(l - /r) 


(=d — A to) 


(41) 


In other words, if the government (given its terminal condition on in¬ 
flation) fails to pursue consistent fiscal and monetary policy, real interest 
rates will take the strain and eventually either (rising r*) government ex¬ 
penditure would have to contract to zero (taxes rise to absorb the whole 
of GNP) or (falling r*) real interest rates would become negative — nei¬ 
ther of which is possible. Of course a sensible government will wish to 
stop any such tendency well before any such stage is threatened. 

This argument has been conducted on the assumption that d is set, 
with changing interest payments on debt being offset by changes in gov¬ 
ernment spending or taxes; if one assumes instead that government 
spending and tax rates are unaltered, then the steady state inflation 
rate under consistent policy, =d = Am, depends on the level of govern¬ 
ment spending and taxes chosen; the algebra of this is more complex 
(see Minford et al., 1980), and the inflation resulting from any initial 
rise in the deficit is much greater than in the analysis above, because the 
eventual deficit is compounded by the rise in interest payments, but the 
essential message remains that there must be consistency between fiscal 
and monetary policy in steady state. 

To conclude, wealth effects imply a constraint across the steady state 
components of fiscal and monetary policy, though not the short-run com¬ 
ponents. Hence in (27) A m = =d, whereas we are free to write any 
short-run response function or error process besides. What we have 
done in this section is to show within a conventional IS-LM model with 
wealth effects essentially the same result as derived in the overlapping 
generations framework of Sargent and Wallace — that there are close 
connections between fiscal and monetary policy choices in the long run, 
but that in the short run there is considerable flexibility in their rela¬ 
tionship. 
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A model with distortionary taxation: the optimal pattern of 
borrowing 

We have seen that if a government wishes to achieve a certain money 
supply growth rate (to reach a desired inflation rate), then it is limited 
in the steady state (or ‘average’) deficit it can pursue at the same time. 
Nevertheless, provided it is willing to make up temporary deficits in 
excess of this with future deficits that fall short of it (even running 
into surpluses), then as we have seen it can still achieve its monetary 
objectives. The question we now ask is: what is a desirable pattern of 
such temporary deficits and surpluses? 

In chapter 4 we looked at this issue from the point of view of sta¬ 
bilization policy (‘demand management’). We concluded first, that tax 
rates acted as an effective automatic stabilizer and that this was de¬ 
sirable, if the tax rates were an unavoidable microeconomic distortion 
and if unemployment was distorted upwards in cyclical troughs by the 
benefit system; secondly that activist, ‘feedback’, fiscal policy was also 
effective and could also be beneficial on similar grounds. But we now 
consider the issue abstracting from the business cycle and stabilization 
policy. We use the criterion of optimal public finance alone: that is, we 
consider the distortionary costs of taxation. In order to do this, we shall 
drop the assumption that taxes are lump sum, and assume instead that 
each period there is an average (= marginal) tax rate, T t . Such distor¬ 
tionary taxation implies that even in an economy with infinitely-lived 
households (or where each generation cares about the others) the pat¬ 
tern of deficits and borrowing affects consumer surplus and so private 
wealth, by creating variation in the extent of the tax distortions over 
time and hence in their present discounted value. It turns out that there 
is an optimum pattern which minimizes this present discounted loss of 
surplus. 

Suppose that output, N, is (apart from temporary effects of tax rate 
variation) continuously at IV, so that stabilization is not of interest, 
and for simplicity suppose further that N is constant, real government 
spending is constant at G, and money supply growth is constant at fi. 
As in the Sargent-Wallace example, let the demand for money be given 
by the quantity theory as: 

H t = hP t N (42) 

and the budget constraint be given by: 

bt - bt-i = G - T t (l - e )N + rbt -1 - Ht ~ J ^ (43) 

“t 

where bt are indexed one-period bonds with a constant real interest rate, 
r, as at the start of this chapter and note that T t ( 1 — e)N deducts 
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tax revenue lost as output responds to the tax rate with an elasticity e 
(assumed constant). We can rewrite the budget constraint, following (4) 
as: 


bt-i = 


1 + r 


E 

,*=o 


1 + r 


T t+i (l - e)N - G + 


hNji \ 

1 + /v 


hN n /-y 
1+M ^ 


E 

»=o 


1 + r 


i +1 


T t+i (l-e)JV (44) 


This says that the outstanding value of bonds must be equal to the 
present value of taxation less the present value of government spending 
(net of inflation tax revenue). This is the constraint on the pattern and 
total of taxation — due to the solvency constraint. 

Now examine the distortion costs of taxation. By the usual consumer 
surplus triangle analysis, we can write the present value of these costs 
at the end of t — 1, C t ~ i, as: 


C t -1 



(45) 


The optimal tax rates are discovered by minimizing C*_i subject to 
the tax constraint. Form the Lagrangean: 

OO 

L = C t -1 + m[K— Y 

i—0 


(ttt) < 4li > 


G— - 


where K = b t -1 H- ■ The first-order condition is: 

dL 


0 = 


m+i 


= T t+i e - m{ 1 - e) 


Since e and m are fixed constants, this yields 

r, + . = r = ld^E 


(47) 


(48) 


This result, due to Lucas and Stokey (1983), extends Ramsey’s (1927) 
rule that commodity tax rates should be inversely proportional to their 
demand elasticities. In other words, the optimal tax rate, T, is a constant 
if the output elasticity is constant over time. We can work out what it 
must be by solving for T t +i = T in the taxation constraint as: 


A 1 - e) 


rb t -1 G _ h/j, 
N + N 1 + n 


(49) 
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The constant tax yield is that which will pay the interest on the 
outstanding stock of debt plus the government spending bill net of the 
inflation tax. Hence the stock of debt is held constant by optimal tax¬ 
ation. (If GDP was growing and government spending constant as a 
fraction of it, then the constant tax rate formula would imply that debt 
would also be constant as a fraction of GDP. The reader may like to 
rework the optimal tax rate problem with GDP growing at the rate n, 
using the budget constraint formulation of equation (16). He should find 
that the steps are identical, except that r — n is substituted for r and 
bonds and government spending are expressed as fractions of GDP; the 
optimal tax rate remains constant.) 

To put optimal tax rates another way, remember that optimality 
requires the intertemporal rate of transformation between revenues to 
equal that between welfare costs; that is, 



where R t -i =Y1 (TTF) 1+1 2Lh(1 — e)iV is the present value of revenues. 
t =o 

In words this states that the (discounted) marginal cost per unit of (dis¬ 
counted) revenue gained from raising the tax rate in period t + i must 
equal that from raising the tax rate in period t + i + 1. The former 
(LHS) is the latter (RHS) is — i n each the numerator is 

the contemporaneous welfare cost, the denominator the contemporane¬ 
ous tax yield, of a penny rise in the tax rate. Notice that since both the 
welfare cost and the tax yield are discounted, the discount factor drops 
out. 

Figure 7.4 illustrates this. It is assumed that workers have a constant 
marginal product MPL. The supply curve, SS, shows the output they 
produce as their marginal real wage, w, rises, increasing their hours 
input into the overall production function. The tax rate, T, depresses 
their marginal take home pay by Tw. The triangles of consumer surplus 
are minimized in total area when they are equal: as one increases from 
this point, the other decreases by less because their area depends on the 
square of the tax rate. The height of the triangle is measured by Tt+i, 
which is also a measure of the loss per unit of extra revenue raised. 


CONCLUSIONS 

We have considered the role of fiscal policy in three basic models: with 
Ricardian equivalence (where there are complete markets, infinitely-lived 
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(a) Equal tax rates 



(b) Unequal tax rates 



consumer surplus loss 


revenue 


Figure 7.4: The optimality of a constant tax rate 


households and non-distorting, i.e. lump-sum, taxes), and then in two 
main cases without it, the first with overlapping generations with finite 
life that do not care about each other, and the second where they do 
(and so behave like infinitely-lived households) but there are distorting 
taxes. 

In the first Ricardian equivalence model the pattern of taxes and 
borrowing does not matter. The only constraint on the government 
is its solvency; if it violates this condition with its tax, spending and 
monetary policies then the markets will mark its debt down in value until 
the condition is met (in other words the debt value factors in expected 
default). Monetary policy sets the rate of inflation and should if properly 
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conducted also fix the price level. Fiscal policy then must set taxes and 
spending so that solvency is ensured. 

In the second model with unconnected overlapping generations we 
saw that there are important links between fiscal and monetary policy 
due to the fact that patterns of public borrowing affect the current gen¬ 
eration’s wealth. These links do not involve short-run behaviour, that 
is, over the economic cycle; independent fiscal and monetary responses 
to the cycle do not in general cause instability (or non-uniqueness) un¬ 
der rational expectations. By contrast, with adaptive expectations and 
wealth effects, coordination may be necessary even in the short run to 
avoid instability. It is in steady state that fiscal and monetary policy 
must be ‘consistent’, that is, the growth rate of real government bonds 
must be reduced to the rate of growth of GDP (assuming this is the rate 
at which the demand for bonds will grow in steady state at constant real 
interest rates). 

In the final model, with distortionary taxation and infinitely-lived 
agents, the pattern of borrowing and taxation affects consumer surplus 
(and so private wealth) and we find that the optimum tax rate that 
minimises this loss of surplus is planned to be constant over time from 
the present state, given past and current shocks (‘tax-smoothing’). As 
new shocks arrive, the tax rate adjusts to maintain the same future 
constancy, with borrowing taking up the resulting difference between 
taxes and spending. 

So, summarising, we can say that given the choice of monetary targets 
to set prices fiscal policy is significantly constrained, to achieve solvency, 
to ensure long-run consistency between debt and money holdings, and 
to achieve optimality in tax patterns. We have moved a long way from 
the models of the 1970s where fiscal policy could be freely set, the only 
result being ‘crowding out’ via interest rate movements. 



8 

The political economy of democracy 


One of the major developments in the public choice literature in the 
past two decades has been the construction and empirical testing of 
models that consider the interaction of the preferences of government 
and electors and the behaviour of the economy. 

Important early examples of this work are the papers by Nordhaus 
(1975), MacRae (1977) and Frey and Schneider (1978a, b). A key as¬ 
sumption of this work is that expectations are formed in an error learning 
or adaptive manner. The purpose of this chapter is to consider the impli¬ 
cations that the rational expectations hypothesis has for the behaviour 
or role of the authorities; we assume that both the authorities and the 
voters have rational expectations. This raises the interesting issues of 
whether there is scope for differences in economic policy between differ¬ 
ent political parties and of how voters will react to differences in policy 
and economic performance. 

We begin by outlining the early work in the public choice literature, 
which is based on adaptive expectations. We then consider the rational 
expectations alternative. 


THE NORDHAUS MODEL 

Perhaps the most interesting model of the adaptive expectations vintage 
is Nordhaus (1975). The Nordhaus model is based on the Downs (1957) 
hypothesis that governments have an over-riding goal of winning the next 
election. Hence they obey the ‘median voter theorem’ (namely, that their 
policies will be designed with maximum appeal to the floating voter who 
will decide the election) and are consequently concerned to maximize 
their popularity over the period of office. This has the implication that 
both parties will offer identical policies, since they will be driven by 
competition for the median voter to offer a policy which will win him or 
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her over: identical policy will give each party an equal chance of winning, 
which is the equilibrium. 

Their popularity is assumed to depend on a number of key economic 
variables, notably the rate of inflation and unemployment. It is further 
assumed that voters, in evaluating the history of inflation and unemploy¬ 
ment, give highest weight when voting to the current rates of both then 
current. The government is assumed to maximize its popularity subject 
to the constraint that the rate of inflation, the rate of unemployment and 
the expected rate of inflation are linked via an expectations-augmented 
Phillips curve. Finally it is assumed that price expectations are formed 
adaptively. The Nordhaus model thus has the following mathematical 
structure (in continuous time). 

Maximize 

P= f G(u,p){\ +r) t (1) 

Jo 

subject to the constraints: 

p = g(u)+p e (2) 

Dp e = <t>(p - p e ) (3) 

where p is the rate of inflation (p e the expected rate), u is the rate of 
unemployment, r is the discount rate ( positive to reflect voter myopia), 
G and g are functions, <f> is a positive constant, and D is the differential 
(rate of change) operator such that, e.g. Dx = dx/dt. Equation (1) is 
the government’s objective function. 

Popularity, P, is maximized between the time of arrival in office (0) 
and the time of the next election (T). Equation (2) is the augmented 
Phillips curve and equation (3) the adaptive expectations mechanism. 

Clearly the optimal paths of unemployment and inflation will depend 
on the precise choice of functions G(.) and g(.). Nordhaus specifies an 
objective function: 

G(.) = -ap - (4) 

and a linear augmented Phillips curve: 

g(u) = a — bu (5) 

where a, (3, a, b are positive constants. 

The mathematics required to solve this problem is outside the scope 
of this book (see e.g. Cass and Shell, 1976). It turns out that the path 
of the unemployment rate between elections implied by its solution has 
the form shown in figure 8.1. 
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Figure 8.1: The Political Business Cycle 


The essential implication then of this work is that governments delib¬ 
erately cause a business cycle so that, at the date of election, they are in 
the most favourable position with respect to voters’ preferences; hence 
on arriving in office, they raise unemployment to initiate a reduction in 
inflation, then after two years or so they stimulate the economy to re¬ 
duce unemployment in time for the election date, leaving their successor 
to cope with the rise in inflation that is the lagged result of this policy. 

While the optimal pattern of inflation or unemployment does depend 
on the precise choice of functional forms, the essential insight of Nord- 
haus, that governments may have a vested interest in creating business 
cycles, will survive these and other changes (such as the length of the 
electoral period T; see Chappell and Peel, 1979). The assumption that 
government popularity depends on variables such as the rate of inflation 
or unemployment has received some empirical support. In particular 
Frey and Schneider (1978a, b) have reported empirical work for a va¬ 
riety of different countries, such as the United Kingdom, the USA and 
West Germany, in which a measure of government popularity, typically 
based on opinion polls, is found to be significantly related to such eco¬ 
nomic variables. 

Subsequent work (e.g. Borooah and Van der Ploeg, 1982; Chrys- 
tal and Alt, 1981; Pissarides, 1980; Minford and Peel 1982a; Harte et 
al., 1983, Gartner, 1994), showed that the work of Frey and Schneider 
is not statistically robust with respect to changes in the sample period 
or the economic variables chosen. This is perhaps not too surprising. 
Economists, when analysing agents’ choice between alternatives, are con¬ 
cerned to stress the appropriate relative prices. This point has typically 
been neglected in the empirical work on government popularity. The 
alternative hypothesis is best outlined by Tullock (1976), who writes: 
‘Voters and customers are essentially the same people. Mr. Smith buys 
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and votes; he is the same man in the supermarket and in the voting 
booth. There is no strong reason to believe his behaviour is radically 
different in the two environments. We assume that in both he will choose 
the product or candidate he thinks is the best bargain for him.’ 

When we also recognize that voters are expressing preferences for 
different policies over the life-time of a government, it is clear that the 
relevant choice should reflect expectations of future policy differences 
between parties. From this perspective the conventional work is implic¬ 
itly assuming that expectations of the future paths of economic variables 
under the party in power are formed adaptively and that the paths of 
economic variables under the alternative parties are considered by elec¬ 
tors as fixed. 


RATIONAL EXPECTATIONS AND REPUTATION 

It is possible to motivate a political business cycle by reference to the 
new literature on reputation and partial information, discussed in earlier 
chapters. Suppose that before the run-up to the election, the government 
has acquired a good reputation for monetary prudence and that there is 
partial information about current events, with people observing current 
economic activity but not observing current policy variables, such as the 
money supply and government spending. Votes are cast according to 
people’s assessment of the equilibrium levels of output and inflation as 
rationally expected, but they have a signal extraction problem. 

In this vein, let election popularity (the voting balance) be given by: 

P t = —{Ey t — ky t _ x ) 2 — f(Eg t — n*) 2 (6) 

where y is equilibrium output (k > 1 reflects the people’s desire for a 
better supply side), and g is the equilibrium level of inflation and also 
of policy levers (money supply growth and the fiscal deficit behind it). 
Let 


Vt=yt-i+£t ( 7 ) 

e t is random, partly the result of past policies whose fruits cannot be 
accurately forecast. Actual output (observed) is given by a Phillips curve 
where g\ is the people’s expectations of government policy based on its 
reputation: 

yt = y t + 0(g t - gt) (8) 

But 


9t = 9t + m t 


(9) 
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where m* is the unforeseen policy change unobserved by the public but 
known by government. 

The public’s signal extraction gives 

Ey t =V t -i + <l>{yt - y t -i) = Vt-i + fat + %t - 9t )] (10) 


Egt = g e t + Em t 


9t 


(i- 4>){yt -y t -i) 

6 


9t + ~—7 r~^( e t + 0(g t - 9t)) 


( 11 ) 


The government now maximizes P t (subject to gf, et, given) 
with respect to gt, in the election run-up by setting 0 = ^-and this 
first-order condition yields 


Q(fc - faVt-i - (1 - faf(9t_ - fa - [</> 2 V + f(1 / ) M 
o 2 (r /(1 o) 2 


+ 9t 
( 12 ) 


g responds positively to target output ( ky t _ 1 ), and positively to a bad 
supply shock (negative e t ). although it is restrained by ‘bad reputation’, 
high g ®. So there is a tendency for monetary policy to expand pre¬ 
election, especially if the supply side is bad. 

The government would in this model aim to build up its reputation 
after an election; then in the run-up it would ‘use up’ some of this 
reputation. Clearly there are strict limits to the policy boost that can 
be delivered in this way; these fall far short of the crude swings in policy 
optimal under the Nordhaus model. Nevertheless, there is still a basis 
here for a political business cycle under rational expectations. Models 
of a similar structure are to be found in Cukierman and Meltzer (1986 
a, b), Alesina and Sachs (1988) and Alesina et al. (1997) for example. 

The intuition behind this and other such models can be appreciated 
if one asks: would any government like to go into an election during a 
recession? Clearly no one can be sure of what is truly the reason for 
the economy’s bad performance; and some blame is bound to rub off 
on the government. Yet the very transparency of this motivation could 
undermine even this limited political business cycle model. Suppose that 
everyone knows the government will maximize Pt in the run up to an 
election. Then reputation-building is useless and the policy unravels. 

First, take the model as at present set up. People would form their 
expectation gf not by observing past actions but by taking expectations 
of (12) and setting E t _ig t = g The result is 


9l = 7T* + 


<f>(k-l)9_ 

THF" 


Vt -1 


(13) 
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When the government comes to maximize (6) knowing et but initially 
supposing the public does not know it, it would choose in general: 


9t = 9t + Q£t (14) 

But the public will know that this is the government’s current policy 
because it knows the government’s preferences. Therefore it will know 
that 


Vt = Vt-1 + (1 + (15) 

where q is the government’s known response to e<. So the public will in 
fact be able to infer the value of et and so Eg t = gt, Ey t = y t . 

(The only case where this will not be so is if q = — i ; in this case the 
public will be unable to judge the size of the shock at all because the 
government is deliberately stopping output from reacting to the supply- 
side shock at all. However, should the government choose this then the 
public will know that, although there will in general have been some 
shock to the supply side they are unable to know what it is; their best 
guess in this case is any value at all, a completely flat distribution. We 
should assume therefore that the popularity function is undefined at this 
point and by implication worse than one that is defined.) 

This being so, the whole problem changes. Now there is no signal 
extraction problem; instead, 

p t = -(yt-ky t -i) 2 - f(gt-n*) 2 (16) 

The government’s maximizing strategy is simple: gt = 7r*. There 
is no political business cycle again: straightforward honesty is the best 
policy. 

Clearly this argument implies an extraordinarily perceptive voting 
public and clear knowledge of the government’s motivation. Both must 
be in doubt. Given the paradox of voting (see next section), let alone 
the costs of investing in good information extraction as a voter, there are 
reasons to doubt such rationality. Government motives may be obscure, 
given the ideological factor; or they may deliberately be obscured as in 
the Backus and Driffill (1985 a, b) model. 

Nevertheless, this case of unravelling at least serves to show that full 
rational expectations with known government objectives does remove the 
political business cycle. It provides a useful benchmark, and cautions 
that the political business cycle, at least if practised too vigorously, is 
likely to self-destruct. 

We now turn to the central issues of what might determine voting 
and party policies if voters have this full (benchmark) rationality. 



218 


Evaluating Government Policy 


A RATIONAL EXPECTATIONS MODEL OF VOTERS 
AND PARTIES 

We proposed a model (Minford and Peel, 1982a) in which the expecta¬ 
tions of voters and the authorities are both assumed to be rational. We 
argue that the marginal costs of information gathering can be regarded 
as sufficiently low for the representative voter to develop an informed 
opinion of the future path of economic variables. One simple mecha¬ 
nism (and there are probably some others) by which this can occur is 
via public forecasts. Forecasts of inflation and output from forecasting 
groups (for instance, in the UK the Treasury, the National Institute of 
Economic and Social Research, and the Liverpool Research Group, be¬ 
sides a plethora of City of London institutions) will represent informed 
opinion and are given widespread publicity by newspapers and television, 
which the voters obtain at negligible marginal cost. While these public 
forecasts will differ to some extent, they will tend to be correlated and 
voters’ expectations, as conditioned by them, will more closely approxi¬ 
mate rational expectations than some mechanistic adaptive alternative. 

Although the typical voter may face low costs of gathering informa¬ 
tion and opinions this does not explain why he votes. The direct marginal 
benefits of voting, in the sense that an individual vote will influence the 
electoral outcome, appear a priori to be less than the marginal costs of 
voting. Indeed, the ‘paradox 0 f voting’ is that, because the effect of one 
vote on the election outcome is negligible, the voter obtains no expected 
marginal gain from voting. However, the most attractive rationale for 
rational voting is the ‘civic’ recognition by voters that democracy cannot 
function unless many people vote (see Mueller, 1979, for a fuller discus¬ 
sion of these issues). In game-theoretic terms, democratic behaviour is a 
game with rules, one set of which governs voting; such games evolve from 
social discovery processes (for a discussion of such evolutionary processes 
see Sugden, 1986). Voters are expected to vote according to their own 
preferences; since most have only the vaguest idea of the ‘nation’s good’ 
and little ability to evaluate it, they are expected to evaluate the effect of 
party policies on their own individual or household interests, where they 
should have a keen and accurate perception. Then the electoral process 
aggregates these votes into a popular preference for one party’s policies. 
A number of authors (e.g. Meltzer and Richard, 1981, 1983) have em¬ 
phasized that such ‘voting with one’s pocket’ opens up the possibility of 
‘rent-seeking’ by voter coalitions with a vested interest: politicians then 
have to weigh up their ability to attract (organized) votes from vested 
interests against their chance of appealing to the (disorganized) votes of 
the ordinary voters. We return below to the effects this choice may have 
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on party platforms. 

Controversy, into which we cannot enter here, surrounds what sort 
of process delivers the better results. Anglo-Saxon systems of first-past- 
the-post deliver strong mandates, which notably Popper (1988, 1945/66) 
has defended as providing a strong capacity of electorates to get rid of 
governments. Continental systems of proportional representation deliver 
governments which can only survive by consensus between coalition part¬ 
ners. The Anglo-Saxon systems have produced large-scale swings of pol¬ 
icy, with ‘experiments’ on a grand scale. The continental systems have 
produced a slowly changing compromise set of policies, which never be¬ 
came as socialist as, say, the UK when socialism was fashionable in the 
1960s but neither moved so rapidly towards deregulation and free mar¬ 
kets as the socialist tide receded in the 1980s. 

Given our assumptions about the information set of agents, we follow 
a number of different authors in supposing that political parties in part 
pursue economic policies which are broadly in accordance with the objec¬ 
tive economic interests and subjective preferences of their ‘class’, defined 
as their core political constituency. For instance, Johnson (1968) writes: 
‘From one important view, indeed, the avoidance of inflation and the 
maintenance of full employment can most usefully be regarded as con¬ 
flicting class interests of the bourgeois and proletariat, respectively, the 
conflict being resolvable only by the test of relative political power in 
the society.’ 

Robinson (1937) also writes: ‘In so far as stable prices are regarded 
as desirable for their own sake, as contributing to social justice, it must 
be recognised that justice to the rentier can be achieved only by means 
of the injustice to the rest of the community of maintaining a lower 
level of effective demand than might otherwise be achieved. We are 
here presented with a conflict of interests...and actual policies are largely 
governed by the rival influences of the interests involved.’ 

While clearly we do not accept the assumption made in both these 
comments that output is demand-determined, they do usefully highlight 
the importance of rival organised interest groups in the political process. 


VOTERS 

We assume that there are three relevant sets of voters: Labour (Demo¬ 
crats), Conservatives (Republicans) and floating voters. The supporters 
of each party come from different parts of the electorate (for example, 
‘labourers’ and ‘capitalists’, see Hibbs, 1978). The stylized assumption is 
that Labour voters primarily hold human capital and the Conservatives 
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primarily financial capital, while floating voters have large amounts of 
both. 

The current utility function of the voters is written in quadratic form 
as: 


Ct = c\pt + c 2 p 2 + Co (Conservative) (17) 

V t = vipt + v 2 Pt + v 3 y t + V4Vt + *’o (Floating) (18) 

L t = hyt + hVt (Labour) (19) 

where ci, c 2 , v\, v 2 , v±, l 2 are negative and co, vq, v 3 , h positive 
constants; y is disposable labour income. 

The floating voter who determines the election outcome is assumed 
to express his voting intentions (up to and including the time he votes 
in the election) according to which party is expected to give him greater 
utility from the time of the next election onwards. Formally he takes the 
expectation E t Vr (which is taken as a proxy for his expected utility for 
all time beyond the election) conditional on each party’s policies in turn, 
EtVjS, EtVf' (Labour and Conservative respectively); he casts his vote 
for Labour if E t V^ > E t Vf and vice versa. In aggregate it is assumed 
that voters are distributed around the typical floating voter, yielding a 
cumulative voters’ balance function of the form: 

B t = b{E t Vfi - E t V°) + h t (20) 

where G denotes ‘government’ and O ‘opposition’; h t . is an error process 
for non-economic omitted variables. Taking expectations of (19) for the 
government yields: 

EtV? = V! E t p T + v 2 E t p 2 T + v 3 E t y T + v^Ety? (21) 

these being the expected outcomes (that is, under the current govern¬ 
ment). 

Doing the same (a ‘counterfactual’ expectation) for the opposition 
and subtracting from (21) gives: 

E t Vr - E t V° = MEtpr -p)+ P 2 (var T p - varp) + f3 3 {E t y T - y) 

+ P 4 (var T y- vary) (22) 

where varp, vary are the variances of p and y around their expected 
values and p, y, varp, vary reflect the relevant expectation and variances 
of the opposition. 
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Unlike previous voting functions, this formulation is explicitly for¬ 
ward-looking in inflation and income and it includes variances of the 
relevant economic variables. In their related empirical work Minford 
and Peel (1982a) use the perhaps unsatisfactory proxy of time trends 
for the opposition party’s policies and there is clearly scope for the use 
of more subtle alternatives. Using Gallup data for the United Kingdom 
over the period 1959-75, they produce evidence that (22) performs in a 
more satisfactory manner than the conventional Frey-Schneider alterna¬ 
tive. However, in later empirical work on the UK, West Germany and 
Sweden, Harte (1986) finds that these voting functions are as unstable 
as their non-rational predecessors. Similar instability is found, with al¬ 
ternative functions under rational expectations, by Borooah and Van 
der Ploeg (1982). This instability is not difficult to explain in terms 
of Lucas’s (1976) critique. These voting functions are, in spite of their 
forward-looking, terms ‘reduced-form’ expressions in which the effects of 
voter preferences, the model and the exogenous processes linking policy 
(in each party), as well as the economic environment, are jointly solved 
out. Even if preferences are unchanged, then other elements will change 
and shift the voting functions. The instability does not necessarily in¬ 
validate the theory but it should make us modest in our expectations of 
estimating it. 

A side implication of this approach is that voter preferences will grad¬ 
ually change with their economic interests. For example, ‘ embourgeoise- 
ment\ carrying with it the wider accumulation of non-human capital 
(including home and share ownership), will increase the size of the ‘cap¬ 
italist’ class relative to the ‘labour’ class. Strong evidence of this change 
has been produced for the UK (Crewe, 1988). There is also evidence in 
opinion poll data from 1987, at a time when Labour policies were still 
designed to appeal to traditional labour voters, that share ownership 
was significantly correlated with voting Tory; with privatization having 
greatly extended share ownership (from 8 to 25 per cent of the popula¬ 
tion), this could have been an important electoral factor. 


EMPIRICAL EVIDENCE ON POPULARITY 

The empirical evidence on the relationship between political popularity, 
as measured in opinion polls, and economic variables is, as we have seen, 
clearly suggestive that the relationships are unstable. Besides the Lucas 
critique, this instability could also be due to omission of key variables: 
for instance social issues such as health or education are omitted from 
the regressions, and could often be important. In addition variables that 
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measure the opposition’s policies are difficult to operationalise and as a 
consequence rarely formally considered in the econometric analyses, but 
again must be relevant to the voting decision. For these types of reasons 
it is perhaps not surprising that the estimated functions display such 
marked instability. Recent work by Box-Steffensmeier and Smith (1996), 
Byers, Davidson and Peel (1997, 2000) points to a different statistical 
reason for the instability. Employing popularity data for a large number 
of different countries and political parties, they obtained the striking 
result that popularity could be parsimoniously modelled as a fractionally 
integrated process, ARFIMA process (Granger and Joyeaux, 1980) — see 
the Time-Series Annex at the end of the book for statistical background 
to this and the following discussion. 

The ARFIMA (jp, d, q) process is given by: 

x t = (1 - L)ut (23) 

where u t is a stationary ARMA (p, q ) process, and d is non-integer. 

ARFIMA processes have the property that the autocorrelation func¬ 
tion exhibits hyperbolic decay rather than the eventually exponential 
decay of the stationary ARMA, or 1(0), process, and enables ARFIMA 
processes to model dependence between observations at long range. But 
at the same time and in contrast with the 1(1) or unit root case, these 
processes exhibit eventual mean reversion. In fact Byers, Davidson and 
Peel (1997, 2000) found a remarkable similarity between the estimated 
d for different countries; they were all around 0.7 and the error term 
was, in most cases, white noise. An estimate of d that lies between 0.5 
and unity carries the implication that the series is non-stationary but 
ultimately mean-reverting. The finding of such a process has important 
implications for trying to ‘explain’ popularity by conventional regres¬ 
sion analyses using economic variables. It has been shown (Tsay and 
Chung, 1996; Marmol, 1998) that the ‘spurious regression’ phenomenon 
best known in connection with unit root processes extends to fractional 
processes. Variables such as inflation and interest rates have also been 
shown to be fractional, if not unit root processes, see for example Baillie, 
Chung and Tieslau (1996). Therefore, simple regression t values may be 
misleading. 

Byers, Davidson and Peel (1997) explained the fractional result in 
the following manner. They hypothesised for simplicity that voters fall 
into two stylized categories: the committed and the uncommitted. The 
committed individuals are those with strong party allegiances. They are 
motivated by conviction or group solidarity, and their voting intentions 
are generally insensitive to news. They will support their party of choice 
through good times and bad. The uncommitted individuals, in contrast, 
who are usually called ‘floating voters’, tend to award their votes on 
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the basis of performance. Newspaper headlines can sway their voting 
intentions considerably, although, by the same token, the effect of news 
is typically transitory and will tend to average out in the long run. The 
current voting intentions of the floating voters are on the whole a poorer 
predictor of future voting intentions than those of the committed voters. 
These features of the voting population are captured in the model by 
assuming that the logarithm of the odds in favour of an individual ex¬ 
pressing an intention to vote for a particular party evolves as a first-order 
AR process, with a parameter that takes a value close to 1 for commit¬ 
ted voters, and substantially below 1 for the floating voters. Consider a 
population of voters, each characterized by a particular value of the AR 
coefficient on the interval [0, 1]. If the distribution of these coefficients 
is as they hypothesize, a result due to Granger (1980) implies that the 
average of a large sample of individual voting intentions (after a logistic 
transformation) should behave similarly to a fractionally integrated pro¬ 
cess. This means that it should exhibit long memory, unlike a stationary 
AR.MA process, in spite of being eventually mean-reverting. 

The Byers, Davidson and Peel explanation of the fractional prop¬ 
erty of popularity data is not based on a formal theoretical model with 
explicit utility underpinnings. It is suggestive that a model that can 
capture the fractional property of popularity data may need to acknowl¬ 
edge differing degrees of ‘habit’ amongst heterogeneous voters. In their 
interpretation it is news about economic variables that drives popularity 
changes, not their actual values. They report empirical evidence consis¬ 
tent with this interpretation. Their empirical findings may be consistent 
with alternative underpinnings but at the least are a clrallange to the 
‘traditional’ models explaining popularity. The fractional property of 
voting responses implies they are stable and predictable in the long run 
but respond to news in the short run. 

One extraordinary regularity survives in all models (certainly for the 
UK): the mid-term swing of polled opinion away from the governing 
party, and back again as the election approaches. This quasi-seasonal 
effect is routinely incorporated into voting functions, but no explanation 
for it exists. The usual rationalization is in terms of a ‘costless protest’: 
the electorate use polls (and by-elections) to signal their preferences for 
modifications of government policy. This does not amount to a desire to 
change the government, so that as the election approaches, true prefer¬ 
ences re-emerge. 
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EXPLAINING PARTY POLICY 

We would like to explain differing party policy within a rational expec¬ 
tations framework. According to Downs’ median voter theorem noted 
above, party policies should be the same: however, a party is formed 
by activists with strong views on policy and it is faced with a choice of 
a likely win by giving these views up or a less likely win while keeping 
the chance of implementing them now or later when the public may have 
been persuaded. We can think of this as maximizing expected utility over 
the future where the utility from implementing policies rises as they get 
closer to the party’s principles but the short-term chances of winning 
(and therefore of implementing them) rise as the platform gets closer 
to the centre. Hence each party will choose some compromise electoral 
manifesto but measurably to the ‘left’ or ‘right’ of the median voter. 

We can tie these ideas in with another strand of the public choice 
literature: this considers the power of vested interests, distinguished by 
their high motivation and efficient organization. Olson (1965, 1982) ar¬ 
gues that they are more effective than ordinary voters, including median 
or floating ones, whose interests are less intensely affected by general 
tax or expenditure changes; hence ordinary voters devote less attention 
(costly information gathering and assessment) to the issues and vested 
interests prevail over policy. Such vested interests will attempt to in¬ 
fluence all parties but because of their ‘class interest’ will have most 
influence with one party typically. Such influence tends to reinforce the 
tendency for party policies to differ. 

We assume that the authorities are faced with an economy in which 
there is no long-run trade-off between inflation and output, but where, 
in the short run, fiscal and monetary policy can stabilize the economy 
by appropriate choice of feedback rules; these are assumed to be effec¬ 
tive on the grounds of, for example, contracts, as discussed in chapter 4. 
The absence of a long-run trade-off does not, however, avoid a choice of 
the long-run budget deficit and, of course, the implied monetary growth 
rate. It might seem that all parties would have as their long-run target 
zero inflation, and hence choose targets for the budget deficit and money 
supply growth to go with this. This is clearly not the case, however, once 
we recognize that a budget deficit with inflation implies a different inci¬ 
dence of the existing tax burden from one with zero inflation, since an 
unanticipated shift to high inflation on the accession of a new govern¬ 
ment will lower the capital value of nominal government debt. This will 
expropriate debt holders to the advantage of the general taxpayer, who 
now pays less tax. 

We assume for formal purposes, very simply, that each party maxi- 
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mizes the expected value at the next election date of a weighted average 
of the utility of its own supporters and that of the floating voter. The 
expectation is formed at time t = 0, the time of strategy choice, and 
it is supposed that this choice occurs only once in each period between 
elections and then cannot effectively be changed. We suppose that a 
party has had its ‘honeymoon’ period, has had to react to the pressures 
of office and after about half a year of its term has settled down and 
then chooses its strategy. The other party has by this time settled in 
opposition and also chooses its strategy. Once chosen the parties cannot 
with credibility change them. 

Formally then, for example, the Conservative party maximizes: 

^E 0 Ct + (1 — h)EqVt (24) 

where /i is the weight given to its own supporters. The function will 
be expected to be maximized at time T, the time of the election. In 
principle it ought to be expected utility from this date onwards, with a 
suitable discount factor, but, for empirical purposes this is considered 
needlessly complicated, given that we have ruled out expected future 
changes in policy programmes. 

Equation (24) is maximized subject to the voters’ preferences and 
their model of the economy. The formal mathematics of this is some¬ 
what complicated (see Minford and Peel, 1982a). However, the implica¬ 
tion of the analysis is important. This is that different political parties, 
who represent different class interests, will pursue different policies. In 
particular party policies will differ significantly, not only in budget and 
money supply targets but also in feedback coefficients according to the 
interaction between voter preferences and the model structure. Labour 
reaction functions will, relative to Conservative reaction functions, em¬ 
body a higher steady state budget deficit and be more responsive to real 
rather than nominal shocks. 

Quite clearly the precise form of reaction function will be dependent 
on the true model of the economy and the nature of voter preferences. 
However, the point of our study remains valid, namely that in an econ¬ 
omy with rational expectations on the part of both government and 
voters, there is scope for systematic policy differences between different 
political parties. 

Empirical results on UK reaction functions for the period 1959-75 
support the model in that significant differences between the political 
parties were discovered. Harte et al. (1983) also confirmed the fruit¬ 
fulness of the approach, and found significant statistical differences in 
reaction functions between the parties in the UK, Sweden and West 
Germany. Differences in reaction functions with respect to political vari- 
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ables have also been found in a number of other countries (e.g. Alesina, 
Roubini and Cohen, 1997). 

These results conflict with the median voter theorem, which predicts 
the same policies for each party. This theorem is therefore rejected by 
the data for first-past-the-post democracies, for which it was constructed. 
For proportional representation systems, the theorem would apply to the 
parliamentary party holding the balance of power; for these democracies, 
there is stronger evidence of party policy convergence, with the differ¬ 
ences in Sweden and Germany appearing less marked than in the UK 
and USA. 

Although there are party differences in policy, nevertheless the me¬ 
dian voter (or party) theorem embodies an important principle: of policy 
convergence towards the centre. This principle has inspired a number 
of studies investigating how far the interests of this median or ‘swing’ 
group of voters (or their party) influence particular, as opposed to gen¬ 
eral economic, policies. Are particular taxes designed to shift the tax 
burden away from these voters? Are government expenditures fixed to 
benefit them? Models which answer ‘yes’ include that of Meltzer and 
Richard (1981, 1983), who also find evidence for these forces. 


RELATED ISSUES AND CURRENT RESEARCH 

Political economy, the topic of this chapter, is a burgeoning area of 
research, for two reasons. First, the government is an agent with objec¬ 
tives, whose actions are of obvious importance: modelling them should 
be superior to treating policy as exogenous or a fixed feedback rule. Sec¬ 
ondly, analytical and computational tools have improved to the point 
where it is feasible to compute the equilibria in games between govern¬ 
ments and the public or other governments. 

Much of this research has already been discussed. But two topics 
deserve emphasis. 

In chapter 5, we discussed the issue of time-inconsistency and the 
incentives both to cheat on promises if believed and, once people realize 
this incentive, to find a mechanism which ensures promises are carried 
out. Models of reputation under imperfect information are attempts to 
explain variations in credibility between total public gullibility on the 
one hand and total cynicism on the other; such models can produce a 
political business cycle, as we have seen. 

They may also produce strategies to bind successor governments into 
policies which benefit the current government. A high inherited level 
of public debt, for example, may restrain a future socialist government 
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(when it comes to re-optimize) from increasing public expenditure; a 
conservative government might then push tax cuts further than otherwise 
in order to reduce tax rates and public spending in the long term, should 
it lose power. Models supporting such strategies are to be found in 
Alesina and Tabellini (1989, 1990) and Persson and Svensson (1987). 

The question of how policy promises can be enforced, by voting be¬ 
haviour or by other parties or by constitutional structures, is also im¬ 
portant, since good policies typically involve making promises which are 
then kept. Rogoff and Sibert (1988) consider voters punishing the gov¬ 
ernment, although this involves the difficulty of why individual voters 
should bother, given their lack of power to influence anything individu¬ 
ally (as in the voting paradox above). Alesina (1987) considers strategies 
where the other party deters promise-breaking in an inter-party pact (‘bi¬ 
partisan policy’): this may not apply well to controversial policy areas, 
such as public spending and inflation. Tabellini (1987) considers the role 
of independent, overlapping governors in an independent central bank. 
A good survey of all these issues is to be found in Alesina (1989). 

The second topic concerns the source of inflation itself. Much ex¬ 
isting literature assumes inflation is motivated by governments needing 
inflation revenue or seigniorage to finance exogenous public spending, 
with the extent of the inflation tax being determined by an optimizing 
choice across tax patterns, to minimize the welfare and collection costs 
of taxation. Yet this approach is at variance with the considerations of 
political interest suggested in this chapter. Instead, inflation may serve 
the interests of the dominant voter and the party that represents him — 
as explored in Minford and Peel (1982a) and Minford (1988). On this 
view, inflation and its associated high interest rates, high inflation vari¬ 
ance and expansionary public spending-cum-deficit programmes, is not 
an accident or the result of Keynesianism, but a deliberate strategy. The 
elimination of these policies will occur, not through better understanding 
nor through constitutional devices (which can always be over-ridden by a 
democratic majority) but through the formation of a new dominant voter 
coalition with different interests. This effect on macroeconomic policies 
parallels the effects explored by Meltzer and Richard (1981, 1983) and 
Olson (1965, 1982) on distributional and micro policies. 


CONCLUSIONS 


Recent work in the public choice literature has considered the interac¬ 
tion of the preferences of the government and electors and the behaviour 
of the economy. This early work assumed that expectations are formed 
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adaptively and generated one key conclusion, namely that the authorities 
may deliberately create a business cycle. When expectations are formed 
rationally, the authorities may still be able to generate a political busi¬ 
ness cycle, provided there is imperfect information and uncertainty about 
their objectives; however, the scope is more limited and can disappear 
altogether as information improves. 

Another key result of the public choice literature is the median voter 
theorem, according to which party policies should be essentially the 
same, because they are designed to capture the floating voter. How¬ 
ever, there is more empirical support for an alternative theory in which 
parties attempt to maximize the expected future utility, not only of 
floating voters but also of their own class-based supporters and associ¬ 
ated vested interests. This will give rise to systematic differences in the 
parties’ reaction functions. 

We also considered the way in which government popularity and vot¬ 
ing behaviour has been modelled. We suggested that the conventional 
approach based on past economic indicators was deficient in ignoring 
expectations of future economic variables and the behaviour of the op¬ 
position. However, because of the importance of non-economic factors, 
it is unlikely that functions based solely on economic variables will gen¬ 
erate statistically robust results; the evidence to date confirms this lack 
of robustness. 

Finally, we considered research on how voter coalitions and vested 
interests may influence the strategies of political parties towards a whole 
range of variables — inflation, public spending, public debt and sub¬ 
sidies, to name but a few. Modern analytical and computational tools 
offer us the chance of a better understanding of the causes of government 
strategy. 
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APPENDIX 8A: DIFFERENCES IN ECONOMIC 
POLICIES OF POLITICAL PARTIES AND CENTRAL 
BANK INDEPENDENCE 

We analysed above the implications of a model in which it was assumed 
that different political parties, who represent different class interests, will 
pursue different policies. This assumption may also have implications 
both for the operation of an independent central bank (see chapter 5) and 
the interpretation of the empirical evidence on the impact of independent 
central banks on economic outcomes as pointed out by e.g. Waller (1989) 
and Alesina and Gatti (1995). 

Essentially if economic policies differ between parties then rational 
expectations of future economic variables will reflect electoral uncer¬ 
tainty. For example inflation expectations for a period in which there is 
an election will be given by 

Ep = jEp 1 + (1 - j)Ep c (1) 

where j is the (assumed exogenous) probability of election of party l and 
Ep c and Ep 1 are expected inflation under the two parties. 

To illustrate formally some implications we employ the model of 
Alesina and Gatti (1995) as set out in Alesina, Roubini and Cohen 
(1997). 

They assume that the utility functions of the two parties have a 
standard quadratic form and are given by: 

U l = -0.5p 2 - 0.5b\y - y*) 2 (2) 

and 

U c = -0.5 p 2 - 0.5 b c (y - y*) 2 (3) 

where y *is the target level of output, assumed equal for the two parties 
to make the algebra simpler. Similar considerations apply to the target 
rate of inflation, assumed to be zero for each party b c is assumed to be 
smaller than b l so that the Conservative party gives relatively less weight 
to output stabilization and more to inflation than the Labour party. 
The supply function is given the simple form 

y = p- Ep + u (4) 

where u is a random shock. The natural rate of output is set to zero. 
Thus y is the output gap. 

Assume that expectations are formed over a period that includes an 
election. 
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Employing the same methodology and timing of events as outlined 
above we obtain the first-order conditions for maximising U l . and U c 
given the supply function and Ep as 

— p l — b\p l — Ep + u — y*) = 0 (5) 

and 

— p c — b c {p c — Ep + u — y*) = 0 (6) 


Substitution for Ep from (1) gives two simultaneous equations that can 
be solved for p c and p l respectively as 


and 


b l (l + b r )y* b l u 

1 + b l (l — j) + jb c 1 + b l 

b c (l + b l )y* _ b c u 
1 + b l (l — j) + jb c 1 +b c 


When party l is in power output is given by 

y = p l — Ep + u 


( 7 ) 

( 8 ) 

(9) 


so that output under party l, substituting for p l and Ep from (7) and 
(1) respectively is given by 


« = (1 -j)(b l -b c )y* u 
1 + 6^(1 — j) + jb c 1 +b l 

The same procedure gives the output for party c as 


( 10 ) 


C = - j{b l -b c )y * u 

' l + b l (l- j)+jb c l+b c 
The variance of output, a 2 y , is given by 


( 11 ) 


^ = jE(y l - Eyf + (1 - j)E{y c - Ey) 2 
since Ey = 0 substitution yields the solution for a 2 as 


2 _ J(l-j)(b l -b c ) 2 (y*) 2 
° v (1 + b l (l- j) + jb c ) 2 

where a 2 is the variance of u. 


+ 


+ 


(1 -j) 


(1 + b 1 ) 2 (1 + b c ) 2 


The variance of inflation, a 2 , is given by 

a 2 p =jE(p l -Ep) 2 + (l-j)E(p c 


Ep) 2 


( 12 ) 


(13) 


(14) 
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Substitution from (1), (7) and (8) gives 

2 j(l — j)(b l — b c ) 2 (y*) 2 j{b l f (1 -jW) 2 2 

ap {l + b l {l-j)+jb c )°- l (l + 5') 2 (1 + 6 C ) 2 lu 

Finally expected inflation is given by 

5 C (1 + b l ) + j(b l — b c ) 

1 1 + b l ( 1 — j ) + jb c 


(15) 


(16) 


Equation (13) is the essential insight of Alesina and Gatti. The 
variance of output is decomposed into two parts. The first is the result 
of electoral uncertainty. Naturally it disappears when j = 0 or 1; the 
variance is also increasing in b l —b c , so that the greater the difference in 
policies between the two parties the greater the variance of output. It 
is also interesting that the variance of output is not independent of the 
target level of output as is the case when there is no political uncertainty. 
The second term reflects the impact of the variance of the economic 
shock. 

It is also interesting to note from (10) and (11) that the average value 
of the output gap is no longer zero under the different regimes. The 
conservative regime will experience a negative output gap on average 
whilst the labour regime will experience a positive one. The result of 
Alesina and Gatti is important for interpretation of empirical work. We 
showed above how appointment of a conservative central banker would 
result in a higher variance of output than the first best. However in 
comparison to the outcome under political uncertainty the variance of 
output could be lower, depending upon the various parameters. This 
point is important both as a potential argument for the introduction of an 
independent central bank and when interpreting the empirical evidence 
on the economic effects of central banks. 

Alesina and Gatti (1995) also consider the interesting issue of whether 
both parties could be better off appointing an independent central banker 
with a given b rather than the non-cooperative outcomes defined by 
equations (13), (15) and (16). They suggest that this can be the case. 
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Unemployment, productivity and 
growth 


The modern macroeconomic model implies that the economy is con¬ 
verging on its natural rate at some speed determined by for example 
overlapping contracts or adjustment costs. Therefore the natural rates 
of output and unemployment become of central interest. 

Much of the literature of the ‘supply side’ dwells on productivity and 
growth, but this neglects the important issue of unemployment which has 
been a particular problem in Europe. This has importance beyond the 
narrow issue of the number of people unemployed because of its social 
significance: politicians attach great importance to ‘curing unemploy¬ 
ment’ because of its obvious unpopularity with voters. Unfortunately 
they tend to alight on measures that address the symptoms, not the 
disease; notably ‘work-sharing’, reducing participation (by for example 
early retirement or ‘family policies’ designed to keep women at home), 
reducing working hours, or indeed reducing productivity growth and the 
penetration of new technology. The reason they pick such policies is 
that the original disease, as we shall argue below, is due to their erecting 
‘social’ support mechanisms that raise labour costs; it follows that cures 
based on ‘labour market deregulation’ (i.e. eliminating or bypassing such 
support) have no appeal to them. Instead they put their faith in mea¬ 
sures that they think may mitigate the side-effects, in unemployment, of 
their (desirable) social policies. 

This tendency of high unemployment to be accompanied by such 
policies is illustrated in Table 9.1. This shows at the end of the 1990s 
how low participation and low working hours tended to accompany the 
high unemployment in Germany, France and Italy. In Italy for example 
were the participation rate to be at the US level with no other changes 
unemployment would be around 30%. Table 9.2 following shows some 
evidence that these countries have also experienced a relative slowdown 
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U nemploy ment 

Participation-)- 

Working 


(%) * 

Total (%) 

(55-64 

Hours** 

US 

4.6 

66.4 

57.2 

1943 

Japan 

4.4 

62.6 

66.2 

2014 

UK 

4.6 

63.9 

51.5 

1826 

Germany 

10.6 

55.0 

42.6 

1557 

France 

11.5 

55.6 

36.1 

1612 

Italy 

12.3 

47.2 

28.3 

1790 


Sources 

* (Economist) mid 1998 **1992, Manufacturing 


(US Bur. Lab. Stats, Washington DC) 
+ Total: 1990 (US Bur. Lab. Stats, Washington DC); 

55-64 Yr Olds: 1995 (OECD) 


Table 9.1: Labour Market Performance 


in productivity growth in the 1980s and 1990s from the earlier postwar 
period; this suggests that their productivity growth too may have been 
held back by such policies. 


UNEMPLOYMENT 

Our focus in this section is on the natural rate, not on the cyclical be¬ 
haviour of unemployment. The latter has to be explained in the context 
of the business cycle models of earlier chapters. The natural rate is 
the equilibrium to which these cycles tend. Milton Friedman (1968) re¬ 
marked in his AEA lecture in 1968 that it was the equilibrium ‘ground 
out by the Walrasian system’ of real demands and supplies. However, it 
never really occurred to macroeconomists to model it until much later; 
Friedman, Phelps (1970) and others using the natural rate concept effec¬ 
tively treated it as a natural constant. It was not until the early 1980s 
in the UK where unemployment rose above 10% with no apparent ten¬ 
dency to fall that models began to be formulated of a changing natural 
rate. The first effort was by Minford (1983); he took the classical labour 
supply set-up of earlier chapters and added the idea of a permanent un¬ 
employment benefit, payable without any check on work availability (a 
peculiarly European concept). The result was to tilt the labour supply 
curve so that the real wage offer never fell below the benefit. This had 
the effect of creating the ‘real wage rigidity’ identified for example by 
Bruno and Sachs (1985) in their account of the 1973-4 oil crisis (fig¬ 
ure 9.1). Note too that with such benefits one can account also for the 
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cyclical behaviour of real wages and unemployment; real wages are pro¬ 
cyclical, rising in the upswing and lifting people out of benefit, falling in 
the downswing so that people go on to benefit. 

Hence unemployment tends to breed policies that inhibit participa¬ 
tion and productivity growth. Our discussion therefore begins with un¬ 
employment. It goes on to the optimal size of government. It ends with 
growth itself. 



1960-1973 

% 


1979-1994 

% 

1 

Japan 

5.5 

1 

Ireland 

2.6 

2 

Portugal 

5.4 

2 

Finland 

2.5 

3 

Ireland 

4.6 

3 

Spain 

1.7 

4 

Italy 

4.4 

4 

Portugal 

1.6 

5 

Finland 

4.0 

5 

UK 

1.5 

6 

Belgium 

3.8 

6 

Denmark 

1.3 

7 

France 

3.7 

7 

France 

1.3 

8 

Netherlands 

3.4 

8 

Belgium 

1.2 

9 

Spain 

3.2 

9 

Japan 

1.1 

10 

Austria 

3.1 

10 

Netherlands 

1.1 

11 

Germany 

2.6 

11 

Sweden 

1.0 

12 

UK 

2.6 

12 

Austria 

0.9 

13 

Greece 

2.5 

13 

Italy 

0.9 

14 

USA 

2.5 

14 

Australia 

0.8 

15 

Denmark 

2.3 

15 

USA 

0.5 

16 

Australia 

2.2 

16 

Germany 

0.4 

17 

Switzerland 

2.1 

17 

Canada 

-0.1 

18 

Norway 

2.0 

18 

Norway 

-0.1 

19 

Sweden 

2.0 

19 

Switzerland 

-0.2 

20 

Canada 

1.9 

20 

Greece 

-0.3 


Source: OECD (1996) as cited in Crafts (1997) 


Table 9.2: Total Factor Productivity (TFP) Growth in the Business Sector (% 
per annum) 

In the figure one can see how the normal marginal product of labour 
schedule can interact with this distorted labour supply schedule to gen¬ 
erate equilibrium unemployment. Should the benefit rise relative to 
productivity, unemployment will result. That is, people will voluntar¬ 
ily refuse to take available wage offers because benefits are preferable. 
They are ‘unemployed’ in the sense that they are not working but are 
‘available for work’: thus in response to the usual survey questions they 
would be counted as wanting work (if at the ‘right wage’ but this is not 
generally included in the assessment) and some governments also would 
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count them as unemployed because they are in receipt of unemployment 
benefit. In any case the unemployment is recognizable as what causes 
social dissatisfaction. 

The labour market model can be generalised to include the effects 
of union power, taxes of all sorts, and employer and employee national 
insurance contributions (which in Europe are largely taxes in nature). 
When placed within the general equilibrium of an open economy one 
obtains natural rates of output, real wages and the real exchange rate 
as well as employment and unemployment (see chapter 10). Later ver¬ 
sions have proliferated; in the UK Layard and Nickell (1985) estimated 
a similar model, and Bean et al (1986) attempted to extend it to other 
European countries which began to experience rising unemployment UK- 
style during the late 1980s and 1990s. It turns out that in each country 
there are substantial idiosyncracies in the social support mechanisms, 
complicating effective modelling of the natural unemployment rate. Nev¬ 
ertheless a large amount of empirical work, both cross-section (Burda, 
1988, was the first to exploit the variation across European countries 
and show the importance of long-duration benefits) and time-series evi¬ 
dence (Layard, Nickell and Jackman, 1991, survey much of it) seemed to 
confirm that these mechanisms, particularly the length of time benefits 
were available and their ease of eligibility, were responsible for persis¬ 
tently high unemployment in Europe. By the end of the 1990s a general 
consensus had appeared, embodied in the OECD secretariat, that ‘labour 
market flexibility’ was the key to reducing equilibrium unemployment. 



Figure 9.1: The natural rate of unemployment 
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Much of the traditional literature on unemployment emphasises sea¬ 
rch behaviour (e.g. Lancaster, 1979; Nickell, 1979). In the absence of a 
permanent unemployment benefit such behaviour would make sense; we 
could model a steady flow of job separations, with people searching for 
some average time determined in the usual search-optimising manner. 
This would give rise to an unemployment equilibrium of the rate of flow 
times the length of search; e.g. if 20% of the workforce separate each year 
and spend three months searching, this would yield an unemployment 
rate of 5% (0.2x0.25=0.05). We can think of this as a ‘frictional’ rate of 
unemployment; plainly in a well-functioning economy the natural rate 
should be such a frictional rate. The very high and long-lasting levels of 
unemployment seen in Europe during the late 1980s and early 1990s are 
not well explained in these terms, however; these high natural rates are 
better explained in terms of the model above, in which the long-term 
unemployed cannot be said in any meaningful way to be ‘searching’. 

Thus a first set of policies to generate high activity should be those 
of labour market flexibility. 


THE OPTIMAL ROLE OF GOVERNMENT 

It is plain that government provides some useful services. These services 
(such as law and order and infrastructure) could be provided privately 
but it is more efficient in practice to provide them publicly; that is, for 
‘public goods’ there is a direct saving of resources from eliminating the 
duplication, the transactions costs and the under-use from private pro¬ 
vision. However, there is also a cost in public provision: that distorting 
taxes must be raised to pay for the service. Though lump-sum taxes 
without a distorting effect are possible, they are so unpopular that in 
practice governments do not raise them to any serious extent (when the 
UK government brought in the ‘poll tax’ in the late 1980s to replace the 
‘rate’, a tax on property values, it contributed to the fall of Margaret 
Thatcher; subsequently the tax was withdrawn in favour of a banded 
property tax). 

We can model these two sides of public spending in terms of the 
labour market and the production function: public spending raises pro¬ 
ductivity but causes a distortion in labour supply — figure 9.2. A helpful 
way of summarising the twin effects as government spending (G) rises as 
a fraction of GDP stems from the Laffer Curve (figure 9.3) which shows 
tax revenue as a function of the tax rate (tax revenue = public spend¬ 
ing). At low levels of spending, the tax rate is low and the marginal 
distortion cost of taxation (which rises with the square of the tax rate 
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according to the standard consumer surplus formula) is correspondingly 
low, while the marginal benefit of government spending is high. With 
efficiency raised by the spending and low tax-distorting inefficiency, the 
revenue yield relative to the tax rate is high. As spending and the tax 
rate rises, this relative yield falls, as the marginal benefit of the spending 
falls and the marginal distorting cost rises. The optimal tax rate and size 
of government is given by To; as spending rises above this point, we move 
towards the revenue-maximising tax rate T max where any further rise in 
the tax rate yields no extra revenue and so permits no extra spending. 
Thus whatever its motives no government can rationally operate to the 
right of this point. 



Figure 9.2: Public spending distorts the labour supply 

This, useful as it is conceptually, tells us nothing in practice about 
where the optimal tax rate is. If we neglect very poor countries in Africa 
and elsewhere with poor infrastructure, there seem to be three main 
groups: Asian emerging-market countries with low tax rates (around 
20%), good basic infrastructure but limited provision of welfare services 
and social insurance like unemployment benefit and public health care; 
an Anglo-Saxon group with medium tax rates (35-40%) and fairly ex¬ 
tensive welfare services/social insurance; and a European group with 
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Figure 9.3: The Laffer curve 

high tax rates (around 50%) and very extensive social insurance. The 
essential problem with the latter group is, as we saw in the last section, 
that generous social insurance distorts labour supply. Furthermore, the 
high marginal tax rates implied have substantial effects on work incen¬ 
tives for taxpayers on top rates at least; evidence from the US (Lindsey, 
1987a, b; Feldstein, 1995) and the UK (Minford and Ashton, 1991) sug¬ 
gest that high earners’ hours respond strongly to rising marginal rates 
so that higher-band tax revenues are likely to fall, putting them on the 
wrong side of the Laffer Curve (this is without including the effects of 
tax avoidance and evasion, and of migration or ‘brain drain’). It is true 
that a degree of social insurance may make workers more willing to be 
flexible in job choice and location (for example the combination of no 
unemployment benefit and strong unions, as in Italy, may make it ex¬ 
tremely difficult to close plants.) Nevertheless in a rich society most 
people would be willing to pay for higher than basic levels of health in- 
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surance, pensions and education; if the state provides these basic levels 
but no more, there is a basis for cutting tax rates to somewhere between 
the Anglo-Saxon and the Asian rates. Such a move has proved to be 
popular in the UK with pensions for example. If acceptable politically, 
it enables the economy to have a less distorting tax system with the 
reduction in government provision offset by higher private provision. 


Government spending as % of GDP 


Anglo Saxon / Oriental 

USA 

36.7* 

Japan 

36.7* 

UK 

43.5** 

Continental European 

Germany 

51.0** 

France 

54.3* 

Italy 

* 1995 ** 1996 

53.2** 


Table 9.3: Public Expenditure Shares in GDP 


GROWTH 

Exogenous growth 

Let us start by discussing a world with exogenous growth, the original 
standard framework of growth theory in which the size of the labour 
force and the progress of technology are both given by forces beyond 
the control of households or their governments (which respond to their 
wishes in some sense, let us assume). 

Consider first the whole world economy. Represent its production 
possibilities in the simplest way as Cobb-Douglas: 

Vt = A t L?Kl~ a ( 1 ) 

For simplicity we will set depreciation to zero. Now, recalling that 
in equilibrium desired savings equal desired investment, we can follow 
Harrod and Dornar to obtain the basic equation of growth equilibrium: 


S = s(r)Y = 1 = k(r)AY = k(r)gY 


(2) 
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where k is the capital-output ratio, y 1 ; or 


k(r ) 


= 9 


(3) 


We can show that growth, g, is exogenously given by technological 
progress and the growth of the labour force; hence (3) will solve for the 
real interest rate, r: 


Kt 

rt = { 1 — a)—; that is, k{r) 
Yt 


n 


1 — a 


( 4 ) 


Wt = a^r (5) 

are the first-order conditions. Substituting for endogenous capital in the 
production function gives us: 

" <6) 


Assuming that there is a steady state with a constant real interest 
rate and growth rate, then taking log first differences of (6) yields the 
growth rate as: 


1 

9 = ~9a + 9l 
a 


( 7 ) 


where = growth of A (L). 

From (4) this is the growth rate also of capital. With the real interest 
rate given by (3), (6) yields the level of output, (4) the level of capital, 
and (5) the real wage consistent with full employment of the labour force, 
L. 


If we now consider an open economy facing a world capital market 
with given world real interest rate, then everything is the same except 
that now with r t exogenous (3) is replaced by: 

s(r) 


k(r) + ^ 


= 9 


( 8 ) 


which serves to determine NFI, net foreign investment (alias the current 
account surplus). 

We have assumed implicitly that each country has its own technol¬ 
ogy. But we could add a mechanism of ‘technological catch-up’ whereby 
technological knowledge flows from high- to low-technology countries 
(via a variety of possible channels — e.g. licencing, corporate take-over, 
or multi-national company investment in low-wage countries using their 
own technology to exploit low wage costs). This catch-up would produce 
faster growth of A until it had caught up with best practice internation¬ 
ally. This ‘convergence’ is the focus of empirical work by Barro and 
Sala-i-Martin (for example, 1998) — note the caveats of Quah (1993). 
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Endogenous growth 

One can investigate more complicated functional forms and also do em¬ 
pirical work on the ‘sources of growth’ (e.g. Denison, 1974) but essen¬ 
tially this was the long and the short of ‘growth theory’ under these exo¬ 
geneity assumptions. It is against this background that one can consider 
‘endogenous growth’. The assumption in this case is that the growth rate 
is a matter of choice (whether by individual households or their govern¬ 
ments). In some sense this seems quite right. There are huge differences 
in growth rates, both between countries at the same time and within 
the same country at different periods of history (Parente and Prescott, 
1999, provide a clear overview of the ‘stylised facts’ of growth). These 
differences seem to be the result of choices within these countries. 

Of course two questions arise. What proximate mechanism produces 
growth? And how do people formulate their choices (that in turn feed 
into the mechanism) in the light of it? These two questions are closely 
intertwined. 

On the first question of mechanisms there is now a huge literature 
theorising on possible ones. One can perhaps single out three as front 
runners — though it must be stressed that empirical testing has lagged 
way behind the theory (not least because of the great difficulties of iden¬ 
tification in this area where so many different mechanisms are at work, 
all of them related in both causal directions with growth) so that we are 
still largely in our armchairs on this issue. 

The first is increasing returns to scale over sections of the produc¬ 
tion function — P. Roiner (1986). Suppose one accepts that in nature 
at some comprehensive level of description there are constant returns 
essentially on logical grounds: if you double every single ingredient that 
is producing something then in principle you should double output since 
all you are doing is replication — putting the same thing side by side 
with itself must give you double. Nevertheless in practice not everything 
is included in the production function, there are always uncosted fac¬ 
tors (‘commons’, resources that are uncharged for because they are not 
scarce in the given situation) and also as size changes so does the nature 
of the operation. For example in ‘virgin territories’ land is free; then the 
increasing penetration of people can reap increasing returns as thresh¬ 
olds of exploitability are passed. A similar thing appears to happen with 
all new technologies (Mansfield, 1968); they follow an S-shaped curve of 
productivity. In the early stages productivity grows slowly because the 
technique is poorly implementable with little learning and few users; as 
more users join and learning increases, productivity rises rapidly; finally 
as all its uses are exploited productivity growth tails off. 
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These ideas underpin specifications of the production function with 
increasing returns — most easily represented by making the technology 
parameter, A t , depend on size of output. Assuming that the stock of 
labour is exogenous, then the increasing returns can be exploited by 
increasing the capital stock. 

A second main route is to model A t as accumulated knowledge, with 
knowledge production an industry in its own right (the R&D sector). 

The third, closely related, route is to include human capital (skills 
of the workforce as opposed to pure technological knowledge) in the 
production function (Lucas, 1988). One can then treat it as investment 
requiring, like physical capital, savings to create it (though in this case 
not recorded financial savings but rather the substitution of ‘creative’ for 
non-creative leisure), in which case the model’s behaviour is essentially 
like that of the Solow model; but because it downgrades the contribution 
of ‘pure labour’ in favour of human capital it implies that saving has 
a bigger effect on growth and so can account more easily for growth 
differences between countries (D. Romer, 1996, section 3.11). Or we can 
attribute human capital to learning-by-doing, so that it increases with 
the level of output say (more precisely with the accumulation of output 
experience). As with increasing returns one can think of this as a way 
by which size raises the technology parameter, A t . 

Having discussed mechanisms, we turn to the second, crucial question 
of how choices are formulated: if people could predict exactly how their 
and other people’s investments would interact and could coordinate these 
choices, then one could envisage a country’s (a social planner’s) strategy 
to exploit (‘internalise’) these increasing returns. If on the other hand 
choices are uncoordinated and each regards the investments of others 
as given and unpredictable, then the increasing returns are ‘external’ to 
individual choice; people will preserve the same first-order condition as 
above but increasing returns will come from them happening to invest 
together for reasons of say perceived improvement in technology. It is 
more normal (and clearly more realistic) to make this latter assumption 
that people act singly so that these effects, through either mechanism, 
are external. We make this assumption initially in order to illustrate the 
workings of the typical endogenous growth model. 

In what follows we shall formally use the open economy assumption 
and treat the real interest rate as exogenously set in world markets. (One 
can also think of the interest rate as reacting in a closed economy, the 
world say, to clear the market for savings and investment; in this case 
will have a varying steady state value, provided there is a steady state, 
as growth varies.) This departs from the frequently-met assumption of 
a fixed savings rate; but it is both more realistic and simpler to handle 



Unemployment, Productivity and Growth 


243 


and in any case the essential properties of the models are unaffected. 

We give two examples, using the model as already set up but adding 
an explicit equation for A t . Our second will illustrate the knowledge- 
production model. The first illustrates the increasing returns and the 
learning-by-doing human capital models: we represent both mechanisms 
by the simple device of making technology a function of size 

A t (Y t )= ex p* t Y?_ 1 (9) 

where n and 7 are positive constants. The lag is there to bring out some 
simple dynamics. If we now substitute into the production function both 
this and the optimal capital stock from (4) we obtain: 

1-c* 

Y t = (exp -t)Y?J?L t ( —! —\ (10) 

a \1 — a / 

implying that: 

fft = -b — fft-i + ffL (11) 

a a 

If 7 > a, then this will be an explosive difference equation and growth 
explodes correspondingly; no steady state exists until presumably at 
some point 7 drops to below a. If and when 7 -7 a, then the model 
converges on steady state growth which is: 

9 = 7 —: ( + 9L ) ( 12 ) 

a — 7 \a J 

and the corresponding output is: 

i i3 > 

What we see here is two exciting results. First, that growth can ‘take 
off’ in an explosive burst, feeding on itself. Second, that even when it 
is not (or no longer) explosive, growth will settle down to a ‘multiplier’ 
on the exogenous sources of growth. From a policy viewpoint any gov¬ 
ernment action that could promote such exogenous sources of growth 
would be prima facie desirable, since the private consumption cost would 
presumably be exceeded by the ‘multiplied’ effect of the action (and cer¬ 
tainly would be far exceeded if explosive growth is triggered). For ex¬ 
ample suppose government-financed training increased human capital, 

inserting a multiplicative term h t into (9). Then the level of output 
1 

would rise by h ; so provided the tax cost is less than this it would 
be desirable, abstracting from distributional issues. Similar arguments 
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could be applied to public infrastructure spending that contributed di¬ 
rectly to the profitability of investment (so raising A t ). Examples could 
be extended easily to many aspects of public spending — e.g. on health, 
policing, social care. Via their contributions to human capital or the 
productivity of private capital they could generate large multiplied ben¬ 
efits. This discussion follows familiar long-standing discussions of public 
spending whenever an ‘externality’ is identified. The difficulties with the 
line of argument are also familiarly those of identifying the benefits with 
any degree of certainty. Here we have only the haziest idea even of the 
functional forms that might embody these externalities let alone the size 
of the parameters; empirical work has barely begun to generate plausible 
estimates. 

The second example we consider is that of knowledge accumulation 
via an R&D industry; it has essentially the same sort of implications. 
We follow D. Romer’s (1996) simplification of models due to P. Romer 
(1990), Grossman and Helpman (1991), and Aghion and Howitt (1992). 
Now let A t be produced by some fixed fraction, a l, of the labour force 
(we ignore capital for simplicity) according to 

Q -^=a L L t A\ (14) 

Substituting capital demand into our production function gives us: 


Y t = A] /a ( 1 - a L )L t 


n 


a 


1 /a 


(15) 


(where Y t is the output available for consumption and investment, ex¬ 
cluding R&D) from which: 


1 

9 = 9 a + gL 

a 


(16) 


as in the Solow case. The difference is now that g A is endogenous. 

(14) is a first-order differential equation in A t which is explosive if 
0 > 1. Hence in this example as in the first we may have explosive bursts 
of growth until 9 falls below 1. Once 9 1, we have: 


dA t 

9a, i = -g- = (17) 


so that 


dkig A ,t 


9t 


gL + (9 — 1 )gA,t 


(18) 
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In the steady state where ailI g^' 4 ’ f = 0, this implies 


9 a = 


9l 

1-0 


(19) 


so that steady state growth of output is as before a multliplier of exoge¬ 
nous growth factors: 


9 = 


1 + 


a{ 1 - 9) 


9l 


( 20 ) 


The policy implications are analogous with respect to intervention 
in R&D. If somehow resources can be diverted to this industry, by a 
rise in ax, for example, then the immediate loss in output is soon more 
than made up for by the rise in A t , possibly an explosive one for a 
time. Nevertheless, such seductive implications must be qualified by 
our empirical ignorance of the parameters as well as of the mechanisms 
generating effective R&D. 

These two examples must suffice to illustrate the proliferation of mod¬ 
els with features of this type. The general implication of these models 
is that with uncoordinated private agents there are potentially massive 
externalities in activities generating ‘growth agents’ such as knowledge, 
human capital, and agglomeration and other sources of increasing re¬ 
turns. 

Suppose instead that private choices are coordinated in some way. 
This could happen in different ways. Government coordination is one; 
as it involves detailed knowledge of the potential gains from new tech¬ 
nological applications and investments, this is not a main candidate ex¬ 
cept for rather basic elements of a joint strategy such as infrastructure. 
Plainly however in any coordination the government is likely to have 
some role to play, if only in agreeing to get out of the way (e.g. in 
monopoly regulation). Probably the main way in which coordination 
might occur is intra-industry joint ventures; there are many examples 
of such collaboration between competitors (such as in the airline indus¬ 
try to develop internet booking systems or in the telecommunications 
industry to develop the new generation of mobile phones and handheld 
computers). One should also not discount popular coordination, now 
more achievable via the internet. 

Assuming such coordination, one can think of a representative house¬ 
hold (like a social planner) optimising its intertemporal utility by choos¬ 
ing a particular growth strategy given all these sources of self-reinforcing 
growth. This clearly produces a highly complex private optimising prob¬ 
lem in which all the opportunities and constraints are internalised; of 
course if the economy could offer explosive growth, this will imply a mas¬ 
sive free lunch and a coordinated jump to a hugely richer world where 
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convergence once more has returned and normal maximization can be 
resumed. The role for government in this set-up is different from the ex¬ 
ternality case just considered. Here government has no business spending 
any resources since there are no (‘external’) opportunities not already 
exploited by the private sector/government coordination. In these cir¬ 
cumstances government regulations, taxes and other interventions would 
inhibit the private sector from exploiting available opportunities. In 
crude terms, where there are large incentives to exploit potential new 
technologies, the private sector will take larger risks and invest more 
resources than where taxes are high and regulations are stringent. In¬ 
terestingly, Parente and Prescott concluded that some such x-factor of 
the degree of non-intervention promised the best hope of explaining the 
stylised facts of growth. 

This emphasises the importance of social institutions and policy 
frameworks, within which in general households take decentralised deci¬ 
sions in their own private interests and yet may have an effective way of 
internalising some externalities. The idea that somehow societies auto¬ 
matically internalise all externalities flies in the face of the obvious evi¬ 
dence of huge differences in the success of different societies in achieving 
growth. There is plenty of evidence that institutions evolve over time 
instead of being the direct object of social choice (Sugden, 1986). They 
are the result either of unprompted social interaction or of some political 
process. In the last section of this chapter we discuss what light ‘political 
economy’ can shed on success and failure in producing institutions that 
are good for the supply side of the economy. 


THE POLITICAL ECONOMY OF THE SUPPLY 
SIDE 

There is a massive literature on the creation and evolution of the institu¬ 
tions that favour or inhibit capitalist growth. North (1981) charted the 
way in which protestant dissent in the low countries and the UK pro¬ 
duced the first industrial revolutions; Lai (1998) has gone further back to 
show how competition in Europe between nation states under the edicts 
of Papal Christendom gave capitalism its secure basis. In two important 
books, the late Mancur Olson (1965 and 1982) set out the mechanism by 
which vested interest groups could prevent the general good (in the sec¬ 
ond he argued that as nations become older they acquire more powerful 
vested interests as networks and clubs have longer to form and become 
entrenched); essentially they can exercise discipline over their members 
who have strong interests at stake, whereas the general public have too 
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little incentive individually to understand how their own interests are 
prejudiced by the action of these groups. Hence for politicians to mo¬ 
bilise opinion in favour of reform is costly and uncertain; whereas these 
groups can offer them rewards, both personal and political, for push¬ 
ing forward their own agendas — an activity known as ‘rent-seeking’, 
in which existing rents are diverted instead of being augmented by pro¬ 
ductive action. This basic idea has led to a substantial applied research 
agenda (e.g. St. Paul, 1996, on the difficulties of modifying costly firing 
regulations in Europe, and Tullock et al., 2000, for a survey of US work). 

However, there are examples of supply-side reforms being undertaken 
in spite of vested interest opposition. Three such are the wide-ranging 
reforms of the Thatcher conservatives over the 1980s and 1990s in the 
UK, and in the US the Carter deregulation of the 1970s and the Reagan 
tax reforms of the 1980s. On these occasions it proved possible for 
politicians to build a sufficient coalition in public opinion to support 
reform. 

So there is a tension between the strengths of vested interests and 
the power of public opinion in asserting its general interests. A political 
economy of institutions should attempt to model this tension. In chapter 
8 we already reviewed the political economy of macroeconomic policy; 
central to this were models of voting behaviour. It is natural to extend 
these to supply-side issues — which we can define as microeconomic 
issues with macroeconomic consequences. 

Many detailed approaches are possible. Here we illustrate them by 
taking one possible model, that of the floating voter lying between ‘capi¬ 
talists’ and ‘workers’ as in the model of Minford and Peel (1983) reviewed 
in chapter 8. The workers, it will be remembered, obtained their income 
stream from human capital, whose value was badly affected by unem¬ 
ployment but largely hedged against inflation; the capitalists obtained 
theirs from financial and physical capital which was largely unaffected 
by unemployment but vulnerable to inflation. We can see similar di¬ 
chotomies of interests between these two groups in respect of supply-side 
issues; unemployment benefits will appeal to workers but be disliked by 
capitalists who will pay much of the bill in return for no reward, redis¬ 
tributive taxation will be the same, as will be such things as minimum 
wages, workers’ rights and firing restrictions. Let us assume that the ef¬ 
fects of an improvement in the general good of the economy is too small 
in normal circumstances to affect the voting behaviour of each group 
relative to its sectional (rent-seeking) interest; thus each group normally 
votes its selfish pocket. We can imagine a status quo in which the vested 
interests of each group of voters are represented respectively by unions 
and employers’ associations; the floating voter is some weighted com- 
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bination of the two groups, which therefore votes a weighted average 
pocket. The question then is how shocks to the economy may alter the 
political equilibrium represented by this floating voter. 

As hints towards the outcome, we may reflect on how — as noted at 
the start of this chapter — downward shifts in general economic pros¬ 
perity have triggered increased intervention (more benefits, taxation and 
regulation). The Great Depression in the US famously unleashed both 
the Roosevelt New Deal interventions and massive protection. During 
the 1980s in continental Europe the rise of unemployment brought in¬ 
creased regulation of the labour market — for example reduced working 
hours — that reduced participation as well. At the same time we may 
also note that crises and very poor economic performance can trigger re¬ 
form because voters suspend their normal voting patterns, so obviously 
atrocious has the general state of the economy become that it pays them 
more to restore its health than to gain a rent-seeking interest. 

This sort of voting behaviour might suggest a model for change in 
the political equilibrium according to some indicator of general economic 
performance, say unemployment. We could assume that at very low 
rates of unemployment (good performance) the floating voters are pre¬ 
dominantly capitalist, with little concern for unemployment (because 
prosperity has enhanced holdings of non-human capital and reduced the 
risk to human capital); at high but not catastrophic rates they are pre¬ 
dominantly workers, with high concern (the risk to human capital has 
risen and holdings of non-human capital have been devalued); and that 
at catastrophic rates they switch from normal voting patterns to be¬ 
come concerned with maximising the general good. Suppose we focus 
on a representative supply-side issue, like the level and duration (overall 
‘generosity’) of real unemployment benefits, B t . This points to a model 
of change in B t being a quadratic in the rate of unemployment, U t . or 
say: 

A In B t = a(Ut~i - c) - 0(U t - 1 ) 2 (21) 

so that initially a rise in unemployment above some critical rate, c, would 
trigger demands for higher benefits; but as unemployment rose, the gen¬ 
eral good element would become more of a restraining factor, until ulti¬ 
mately voters demanded reform and benefits were cut. One could postu¬ 
late similar mechanisms affecting other supply-side policies; for example 
the tax rate would tend to rise as benefit bills rose with intermediate 
unemployment, but be cut once the crisis had hit, while demands for 
regulations would tend to mirror demands for benefits. 

If we combine this with a standard model of (long-term) unemploy¬ 
ment, as set out earlier in this chapter: 

In U t = a In B t + U 0 


( 22 ) 
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we obtain the interesting non-linear relationship: 

^ = AlnU* = aaiUt-! - c) - a(3(Ut-i) 2 (23) 


U t = (1 - ac)Ut~i + aa(U t - 1 ) 2 - ap(U t ^) 3 (24) 

This is just an example of the sort of model one might use to simulate 
the tendency of the political equilibrium to add supply-side damage on 
top of a bad shock, perhaps from demand; producing a ‘vicious circle’; 
but also of the phenomenon of drastic reform prompted by disaster that 
causes vested interests to set their narrow aims aside. Depending on 
how rapidly the economy improves after such reform, a virtuous circle 
can result as unemployment drops sufficiently for demands for restored 
benefits to disappear. What we are suggesting here is that good macroe¬ 
conomic management has a crucial role in supporting good supply-side 
policy; just as earlier we saw that poor supply-side policy may create 
pressures for inflationary macroeconomic policies. There are intimate 
linkages through political economy between the two sorts of policies; 
and these links have the capacity to create both vicious and virtuous 
circles of economic performance. 


CONCLUSIONS 

In this chapter we have extended our discussions of macroeconomics 
outside the area of monetary and fiscal policy and the ‘demand side’, in 
which the natural rates of output and unemployment are taken as given 
by exogenous supply factors; and we have focused on the determinants 
of these factors themselves. We began by setting out a theory relating 
persistent unemployment (dominated by a large number of long-term 
unemployed) to the interaction of generous long-duration unemployment 
benefits with taxes and other labour market distortions, like union pow¬ 
ers. We then looked at the determinants of growth and productivity, in 
the ‘new theory’ of endogenous growth that has been built on the Solow 
model. Finally, we reviewed some evidence that, paralleling the links 
from poor supply-side performance to inflationary policies discussed in 
chapters 5 and 8, there appear to be links from severe business cycle 
shocks to poor supply-side policies, the most notable example being the 
Great Depression. We suggested that political economy models could 
account for these links too, embodying the tension between voting along 
vested interest lines and voting, perhaps under conditions of crisis, along 
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lines dictated by general welfare. These models should be able to ac¬ 
count for the phenomenon of vicious and virtuous circles in economies’ 
behaviour generated by these linkages. 



10 


The Macroeconomics of an Open 
Economy 


All economies are of course open to trade and capital transactions with 
others; only the world economy is not. So in principle, the models we 
have dealt with to this point are all to be considered as models of the 
world economy. Even the United States, once blithely treated as ‘closed’ 
by the majority of macroeconomists, has been recognized in the past two 
decades at least to be importantly affected by international influences. 

These influences are transmitted through the balance of payments 
which records the transactions of a country’s residents with residents 
elsewhere. The current account records all transactions with non-resi¬ 
dents which alter the net assets of residents, that is, transactions that 
require payment. Exports of goods and services and receipts of factor 
income from abroad (interest, profits and dividends, and remitted rents 
or wages) increase residents’ net worth: if residents do not receive direct 
payment by cash or cheque, then they receive IOUs in some form. Im¬ 
ports decrease residents’ net worth. Exports net of imports, the current 
account balance, therefore measures the change in residents’ net worth 
as a result of transactions with ‘foreigners’, non-residents. This implies 
that movements in the current account have net wealth effects wherever 
in the domestic economy wealth matters — notably spending, possibly 
the demand for money and the supply of labour. 

The capital account of the balance of payments records the capital 
counterpart of all current transactions (that is, payment in some form by 
a drawdown of assets against foreigners) and also all transactions with 
non-residents which do not alter net assets of residents but rather re¬ 
shuffle them: for example, borrowing from or lending to them (a swap 
of a loan liability for cash), equity issues (a swap of the ownership of 
physical assets for cash), direct purchase of physical assets or ‘direct 
investment’ (a swap of physical assets for cash) or combinations of these 
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Assets 

Liabilities 

1 

Physical capital 

4 Government bonds 

2 

Value of cumulated 

(inc. Treasury bills) 


current deficits 

5 Currency in hands of 

3 

Reserves 

banks and public 


Table 10.1: Consolidated bank / government balance sheet 


(like equity purchase financed by borrowing). It follows that the net 
change in the capital account (the rise in net assets) is exactly equal to 
the current account balance. By convention a rise in assets is recorded 
with a minus sign so that the capital and current account balances thus 
sum to zero. 

Within the capital account, transactions with the central bank are 
singled out. The central bank’s net foreign asssets, the ‘foreign exchange 
reserves’, can finance purchases and sales of the domestic currency, usu¬ 
ally for reasons of exchange rate policy. Apart from net interest earned 
or capital gains, which are typically small enough to neglect, the reserves 
will only rise or fall because of these sales or purchases, that is, foreign 
exchange intervention. 

The reserves have monetary significance. If we think of the central 
bank as the monetary arm of the government, issuing money and holding 
reserves for it, then we can write the consolidated bank/government 
balance sheet as in Table 10.1 

Items 1 and 2 are the result of the government’s past decisions to 
invest (1) and to consume more than its revenue (2). These are financed 
by borrowing (item 4) or currency issue (5). Changes in item 3, the 
acquisition of foreign assets (net), must be similarly financed by new 
borrowing or currency issue, given the inherited items 1 and 2. In change 
terms we can write: 


AM = AR + DEF - A B (1) 

where M = currency issue, R = reserves, DEF = the government’s 
investment and consumption less revenue (‘total’ deficit, or ‘borrowing 
requirement’), B = bonds outstanding. DEF and B can be computed 
either inclusive or exclusive of capital gains or losses. The essential point 
of (1) is that if there is foreign exchange intervention, A R, to sell the 
domestic currency, this will increase the money supply, AM, unless there 
is an offsetting open market operation, in this case the sale of bonds, AB , 
for currency. 

We can immediately link this to the exchange rate regime. Under 
fixed exchange rates, the central bank stands ready to maintain the 
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exchange rate with whatever intervention is needed. At this exchange 
rate, the economy will throw up a current account, CURBOP, and a 
balance, CAPBOP, on all capital transactions not involving the central 
bank (or ‘capital balance’ for short). By definition the current account 
balance, being the net acquisition of foreign assets, must of course be 
equal to the balance of net foreign assets transactions recorded in the 
capital account, including transactions in the foreign exchange reserves. 
By the balance of payments convention noted above (an application of 
double-entry book-keeping), assets acquired are recorded with a negative 
sign in the capital account, liabilities with a positive sign. This implies 
that all non-reserve transactions which would increase the reserves, by 
requiring payments from foreigners, are positive, and vice versa. 

Hence 


CURBOP + CAPBOP = A R (2) 

It follows that the supply of money is affected through (1) by what¬ 
ever reserves movements are thrown up at the fixed exchange rate. 

A floating exchange rate is defined by the absence of foreign exchange 
intervention: A R = 0. Hence the rate has to move to whatever level will 
continuously force the current and capital accounts together to zero. 

A useful way to approach the open economy aspects then is to set 
up models of the current and capital account respectively. 


THE CURRENT ACCOUNT: TWO MODELS 

There are two main models of trade flows in the short or medium run. 
The first assumes that products are non-homogeneous and produced un¬ 
der imperfect competition: the products of British firms differ from those 
of German firms, say, even in the same market. The second assumes 
that products are homogeneous and produced under perfect competi¬ 
tion: prices for British and German output of the same product are 
equalized across all world markets (that is, in any given market they 
will be equal, though transport costs will cause prices of both to differ 
between markets). 

Let us take each model in turn, in each case treating home costs and 
prices as given: we will revert to this ‘supply’ aspect shortly. All prices 
will be expressed in home currency so that the exchange rate is implicit 
in the foreign prices. 
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Imperfect Competition 

The imperfect competition model is illustrated in figure 10.1. XX shows 
the demand for exports by non-residents (shifted by world trade, WT ), 
MM that for imports by residents (shifted by domestic demand, D). 
The c f>M curve shows the real value of imports as Ph / Pf (= !/</>) varies. 



Figure 10.1: The imperfect competition model of the current account 

This is the familiar model of import and export demand, encountered 
in the majority of macro models. The resulting current account equation 
is: 


CURBOP = P h X(WT , P H / P F ) - P F M(D, P H / P F ) (3) 


In real terms 


CURBOP 

Ph 


X - (j>M 


( 4 ) 


where (f> =P F /P H is the inverse of the terms of trade. It is usually 
found in empirical work (e.g. by Stern et ah, 1976) that the current 
account improves after a few years when the terms of trade worsen (as 
could occur with a devaluation that is not fully offset by a rise in home 
prices), although in the short run because of lags in the response of trade 
volumes the current account may worsen (the ‘J-curve’). The condition 
for improvement (the Marshall-Lerner condition) can be written in terms 
of elasticities as: 


P E-m 1 > 0 


( 5 ) 
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This applies exactly when trade is initially in balance {X = (f>M) and 
is derived straightforwardly from: 

6M <j> 


n . .CURBOP ,. v 6X <t> 

0 <[<( Ph )/w]-Hx = w - x 


M ' ~ C ‘ +Cm 1 |6 * 
remembering that e rn = - ^ is positive by the elasticity convention. 

Perfect Competition 


The perfect competition model is illustrated in figure 10.2. We take non- 
traded goods (‘home’) prices, Pnt, as given in this case; traded prices, 
Pt, are of course set by world market conditions. A devaluation, not 
offset fully by a rise in home prices, will raise Pt relative to Pnt- Do¬ 
mestic demand, D 1 depending overall on output, y, and wealth, 9 , is split 
between traded and non-traded goods by relative prices, Pnt/Pt- The 
supply of non-traded goods, at given Pnt, rises or falls to meet demand 
for them (in practice market-clearing will require that Pnt move to 
equilibrate the two). The supply of traded goods will depend on traded 
prices relative to home prices (and costs): we will assume throughout 
that capital stocks are fixed (although this can easily be varied to obtain 
long run solutions; see the appendix). 

The current account equation we obtain is: 

CURBOP = P t (Qt[Pnt/Pt ] - D T [D, Pnt/Pt}) (7) 

where we have aggregated over exported and imported products; in fact 
PtQt = PxtQxt + PmtQmt (similarly PtDt), so if there is an im¬ 
provement in the terms of trade, Pxt/Pmt , the current account would 
improve, with no consequence for trade volumes provided Pnt/Pt re¬ 
mained constant — unlike the imperfect competition model. A fall in 
Pnt/Pt (due say to a devaluation not fully offset by a rise in Pnt) will 
unambiguously improve the current account in both the short and the 
long run — unlike the effects of the analogue, a terms of trade change 
in the imperfect competition model. 


HOME PRICES: THE SUPPLY SIDE 

We held home prices constant to study the partial equilibrium determi¬ 
nation of the current account, largely from the demand side. We now 
discuss how home prices in the two models are determined, and also the 
general demand-supply equilibrium of the model. 
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Figure 10.2: The perfect competition model of the current account 


Imperfect competition 

Figure 10.3 illustrates this in the imperfect competition model (it is a 
diagram adapted from the earlier Parkin and Bade diagram of figure 
3.2). In the bottom left hand quadrant, the labour supply curve slopes 
upward as the real consumer wage, W deflated by the consumer price 
index, 7 r, rises. 7r = Pj)P}~ a ' , that is, it is a weighted average of home 
goods prices and imported prices. The demand for labour however reacts 
to the real producer wage, 

W_ = W' JL = W.(PF\l-a 
Ph 7T Ph 7 T P H 

so as ( Ph/Pf ) = e rises, ^ the real producer wage falls relative to 
the real consumer wage and the demand for labour rises, shifting DD 
rightwards. The higher employment is translated, by the production 
function and the 45° line in the top two quadrants, into an open economy 
supply curve, OS in the bottom right-hand quadrant. Finally, we use 
figure 10.1 to derive a curve showing combinations of e and D that give 
current account equilibrium: that is, where the XX and <j)M curves cross 
each other as D is varied. The resulting curve is the XM in the bottom 
right quadrant of Figure 10.3: note that since along it, D = y (implied 
by current account equilibrium), we can draw it in e, y space. 

When these relationships are put together, we can see in figure 10.3. 
that eo, j/o i Lo> (W/n )o are the general equilibrium, conditional on the 
capital stock (we can also let this vary: for this long-run case, see the 
appendix). 
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Figure 10.3: Supply and Demand in the Open Economy under imperfect com¬ 
petition (capital stock fixed) 


Perfect Competition 

Figures 10.4 and 10.5 explain how the perfect competition open econ¬ 
omy supply curve is derived. Figure 10.4 shows the demand for labour 
by traded and non-traded goods industries: the non-traded demand is 
assumed to be more elastic that the traded, because factor proportions 
are more malleable, given the capital stock, than in manufacturing (hotel 
porters versus machine operatives). Since 

7T = P^Pt 

it follows that 


W/ 7T = {W/P N t){Pnt/PtT 

and also that 

W/n = (W/P T ){P N T/PT)~ {1 ~ a) 
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This implies that if Pnt/Pt (which now becomes the real exchange rate, 
e) rises, then at a given W/u, producer real wages rise for the traded 
goods sector, to w' T , but fall for the non-traded sector, to w' NT . Overall, 
under our assumption, the demand for labour would then rise, just as in 
figure 10.3. 



w' T , w' NT show the producer real wage facing traded and 
non-traded goods producers respectively as e rises from 
e 0 to e,, (W/j[) 0 remaining unchanged 


Figure 10.4: The labour market under perfect competition (capital stock fixed) 

Figure 10.5 then takes up the rest of the story, as in figure 10.3. 
In the top left-hand quadrant is shown the production function for the 
whole economy (/) as well as those for traded and non-traded goods: 
clearly one can trace out production in each of these two sectors as well, 
from their demands for labour in figure 10.4. 

The XM curve in figure 10.5 is derived from figure 10.2 by varying 
D and tracing out the Pnt/Pt values for which there is an intersection 
between demand and supply for traded goods (current account equilib¬ 
rium). As with the derivation of the XM in figure 10.3, D = y at these 
values so that we can draw the relationship as between Pnt/Pt and y. 
General equilibrium is at the intersection of the XM and OS curves, at 
eo, 2/o, L 0 , (W/n)o- 
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Figure 10.5: Supply and Demand under perfect competition (capital stock 
fixed) 


The open economy supply curve (OS) and the effect of prices 

The OS curve we have just derived can be thought of as the equivalent 
of y* in the closed economy Phillips curve: that is, it is the state of 
the economy when there are no price surprises (including prices different 
from those built into wage contracts for the New Keynesian case). 

The analysis when there are price surprises is the same as in the 
closed economy case, only here the OS curve shifts in e, y space — a 
presentational difference only. Figure 10.6 illustrates the resulting open 
economy ‘Phillips curve’ effect. 
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Figure 10.6: A price surprise shifts the OS curve: the open economy Phillips 
curve effect 


THE CAPITAL ACCOUNT AND THE REST OF 
THE OPEN ECONOMY MODEL 

We shall see in Chapter 14 (where we deal with efficient markets) that, as 
a result of portfolio diversification, the equilibrium price or return on an 
internationally traded asset is (possibly totally) insensitive to variations 
in its supply. In the case of interest rates, this gives rise to the uncovered 
interest parity condition: 

R t = Rfj — (E t S t+1 — St) (8) 

This safe return on one-period bonds (in domestic currency terms) 
is the benchmark for all other domestic asset returns: if the yield is 
sensitive to the supply of domestic assets, one can add to (8) an extra 
term in the stock of net foreign assets, and possibly also total assets, 
inclusive too of the physical capital stock. However, empirically such 
terms are rarely significant and we ignore them here. 

It is useful to rewrite (8) in real terms: 
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n = Rt - E t ir t +1 + TT t = R F ,t - E t nF, t+1 + 1 r F ,t 

- (E t S t+ 1 + Etirt+i - E t Tr F ,t+i) + (S t +7r t - n F ,t) 

= r ftt ~ E t e t +1 + e 4 (9) 

where r is the real interest rate and e the real exchange rate defined 
in this section as the (log) ratio of the consumer price deflators. (9) is 
the L BB curve’ of the Mundell-Fleming model (Mundell, 1960; Fleming, 
1962) along which the foreign exchange market is always clearing. 

Whether we have the perfect competition model or the imperfect 
competition model, this definition of the real exchange rate will do 
equally well. In the perfect competition model, 7r — 7 Tf = (1 — cx)P nt + 
aP F — 7 Tp (all prices are now expressed in logs). If n F = Pt , then 
7T — TTp = (1 — ck)(P/vt — Pt)- In general n F = SP F + (1 — S)P f ,nt 
and 7r — 7r^ = (1 — a)(.P/vT — Pt) + (1 — 8)(Pt — Pf,nt )• Clearly in 
all cases tt — n F picks up the movement of domestic non-traded prices 
relative to foreign traded prices converted into domestic currency. In 
some circumstances there will also be an error reflecting an equivalent 
movement abroad. 

In the imperfect competition model, 

7r — it F = aP H + (1 — a)P F — SP F — (1 — S)Ph — 

(a + 6-l)(P H -P F ) 

where a > 0.5 < 6. Here 7r — n F is a transform of the terms of trade, 
the real exchange rate in this model. 

In either case we will incorporate coefficients (1 — a) and (a + 6 — 1) 
into the coefficients of e± in the IS, XM and OS curves in the rest of 
the model, to which we now turn. We can modify the IS curve as: 

Ut = -ar t ~ Se t + k0 t + d (10) 

where 8 = net nominal wealth, including now net foreign assets, and e 
enters because of the effect on demand of the current account. The LAI 
curve becomes: 


m t = 7r t + ny t - (3R t (11) 

Now let us ignore government bonds as net wealth; we can then write 
the current account as 


A Q t = —qe t - yy t 


( 12 ) 


Add the definition 


Gt — 7r t + St — 7T F t 


(13) 
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the open economy supply curve: 

yt = &e t + p(n t - Et-mt) (14) 

and the definition of real interest rates: 

n = Rt - (E t n t +1 - 7 i t ) (15) 

Equations (9) to (15) make up a seven equation model determining 
y, e, r, 6, i r, R and either m ( S fixed: fixed rates) or S (m fixed: floating 
rates). The solution can be found by the methods in chapter 2: it turns 
out that the characteristic equation is second order. We will assume 
that the solution is well-behaved, with both forward and backward roots 
stable. 

Here, rather than find the solution algebraically, we use a graphical 
method. We can note (from the solution) that E t e t +1 = je t + £± where 
s t is a combination of the current shocks to the model and j is the stable 
backward root. Now use (9), the BB curve, to eliminate ?’ t in favour of et 
throughout the model. When we do this substitution in the IS curve we 
are obtaining Parkin and Bade’s (1988) ISBB curve, the IS curve when 
the foreign exchange market is cleared. Notice that this ISBB curve 
shifts with dt-i, the previous stock of net foreign assets (we eliminate 6 t 
in terms of et, yt and 6 t ~i in (12). 

The XM and OS curves have been derived earlier. This leaves the 
determination of prices in the right-hand quadrant. Under fixed ex¬ 
change rates (S = S) we can use (13) to obtain prices along the 45° line, 
FX. Under floating, prices are determined by (11) using (9) as follows: 

7r t = m t - ny t + (3(r F<t + E t Tr t + 1 - 7r t + (1 - j)e t - e t ) (16) 

So the slope of the FL curve is (given by the solution E t n t+ i = 
jnt + r/ t r/ t being a combination of shocks, like £t in the analogous 
solution of E t e t +i = je t + e t ) 

Set = 1 +(3(1-j) 

Sn t (3(1 - j ) 

Finally we can illustrate how the model works, with a temporary 
fiscal shock, shifting the ISBB curve temporarily to the right. This 
is shown under floating in Figure 10.7 by the dashed lines. The ISBB 
shifts out the ISBB t . This raises output, shifting the FL curve to F\ L \, 
lowering prices and so shifting the OS curve left to O 1 S 1 . This creates 
a current account deficit (equilibrium in period 1 is to the right of the 
XM curve); note that the XM curve partitions the e,y space between 
falling 6 (to its right) and rising 6 (to its left). Thus 9 falls so that next 
period, the ISBB shifts leftwards to ISBBo. Also in period 2 the OS 
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curve returns to OS\ (7r“ e now being zero), while the FL curve shifts to 
the right with falling output. From period 2, the ISBB-OS intersection 
lies to the left of the XM; so 6 continues to rise (at the rate 1 —j, along 
the stable path) until eventually it reaches the XM curve. 



Figure 10.7: The full open economy and a fiscal shock under floating rates 

The same fiscal shock is shown for fixed rates in figure 10.8. It can be 
seen that apart from the effect on prices, which may fall under floating 
but must rise under fixed, the effects are quite similar. The real exchange 
rate, real interest rates and output rise; and these rises are later reversed. 



Figure 10.8: A fiscal shock under fixed rates 

Needless to say, this model represents just the barest bones of a fully 
specified open economy model. But it should clarify the key elements, 
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before the encrustation of complications from adjustment lags, contracts 
and the rest. 


CONCLUSIONS 

The open economy model is of key importance in applied macroeco¬ 
nomics, since all economies (except the world) are open! This chapter 
has extended the earlier analysis to capture the main essentials of open¬ 
ness. 

The current account determines movements in net foreign assets, a 
key element of national wealth; these impact on spending. The cur¬ 
rent account depends on demand and relative prices; either relative 
home/foreign prices (under imperfect competition) or relative traded/ 
non-traded prices (perfect competition). These relative prices are in turn 
determined by supply conditions interacting with demand: the open 
economy supply curve is derived analogously to the Phillips curve in the 
closed economy, but wages react additionally to the changing relative 
foreign prices. 

The capital account (excluding the central bank’s operations) has 
monetary importance: together with the current account it determines 
the demand for and supply of money. If the central bank does not inter¬ 
vene in the foreign exchange market, then demand must equal supply: 
the exchange rate moves to ensure this — floating rates. Consumer prices 
are then determined by the domestically set money supply, interacting 
with the demand for money. If the central bank intervenes, to hold the 
exchange rate — fixed rates — the reserves will adjust to equal the gap 
between demand and supply: but changes in the reserves will change the 
money supply through the central bank’s balance sheet. With market 
efficiency, the reserves will adjust so rapidly as capital flows to take im¬ 
mediate advantage of any uncovered interest differential, that the central 
bank loses all, even short term, control of the money supply. When all 
these factors are put together, the open economy model shows how the 
economy can be stimulated termporarily, with the real exchange rate and 
real interest rate moving in sympathy with output and the price level, 
but that losses of net foreign assets cause it to reverse this stimulus in 
order to restore the economy’s original wealth. 

It can be seen that under floating exchange rates an open economy 
has essentially the same structure as the closed-economy models we anal¬ 
ysed earlier: the floating exchange rate adjusts to insulate domestic mon¬ 
etary policy from foreign monetary policy. The main differences are that 
the effects of interest rates are reinforced by the real exchange rate which 
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rises with the real interest rate, and that there are additional spillovers 
from foreign shocks. However, the decision to fix the exchange rate 
causes a radical alteration in the behaviour of the economy by subordi¬ 
nating its monetary conditions to foreign monetary policy: its nominal 
interest rates are fixed abroad. 
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APPENDIX 10A LONG-RUN EQUILIBRIUM IN 
THE OPEN ECONOMY 


In the long run, the capital stock is variable. It is natural to assume 
constant returns to scale, since there is a tendency for industries to be 
driven to operate at their lowest cost level by the processes of interna¬ 
tional competition: if there were decreasing returns to scale, industries 
would contract and if increasing, they would expand until constant re¬ 
turns were achieved. As for external returns to scale, there is pressure to 
internalize these externalities and similarly to drive industry operations 
to the minimum cost point: even if the externalities are at the national 
level, forces operate to internalize them through national policy. 

Imperfect Competition 


Assuming constant returns to scale and mobile international capital, 
figure 10.3 is redrawn for the long-run figure 10.9. Now in the labour 
market there is a unique feasible real consumer wage that can be paid 
given the terms of trade and the international cost of capital (f). Because 
Pii = (rPQ 1- ^ and n = P§Pp~ a , therefore: 


W _ W 
~ ~ p%pjr a 


PfP^“ 


(p H /p F ) ( 3- a) f ( 1 -D 


(note that > 1 > a; capital goods are assumed to be imported but 
this does not affect the essential argument). 

As e rises from eo to ei, so the feasible real consumer wage rises; 
labour supply increases. Also pL = [=pQG -1 l also rises, so that labour 
input per unit of output is reduced and the production function shifts 
upwards. The open economy supply curve is accordingly flatter in the 
long run than in the short. 


Perfect competition 


Figures 10.10 and 10.11 show the long run equilibrium of the perfect 
competition model. As in the short run we have n = P,y T “ Pf* - But in 
the long run under constant returns we also have 

P T = W^rPr) 1 - 6 (1) 


P NT = (t\Pt) 1_/3 


( 2 ) 
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Figure 10.9: The open economy under imperfect competition in the long run 


Together these two imply that e = = r 1 Using the defi¬ 

nition of 7r, they also imply that 


TTT/ _ 1-0+cW-S] 

W/ 7r = r 


(3) 


So assuming that n is set by domestic monetary conditions, these 
three equations determine W, Pr, Pnt as functions of 7r and r; in other 
words, r determines real consumer wages (W/t:), the real exchange rate, 
e = Pnt/Pti and real product wages, ( W/Pt, W/Pnt)- Figure 10.A.2 
shows how from e* (given by r), any slight rise in e would lead to an 
infinite expansion of the traded goods industry, and vice versa; assume 
for these purposes that labour is in infinitely elastic supply at some W 
and that Pnt is given by (2) so that non-traded industry expands to 
whatever is required by demand. As demand is varied, traded goods are 
made available by this process (Pt rising or falling) to keep the current 
account in balance. This yields the flat XM curve in Figure 10.11. 
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As r 0 falls to iq, e 0 * rises to e-i* if NT is 
labour intensive 


Figure 10.10: The perfect competition model of the current account in the 
long run 


Turning now to constraints on labour supply, we see in Figure 10.A.3 
how at each e* (corresponding to a certain r) there is a unique real 
consumer wage, (W/n)*; any rise of (W/tt)* above this reduces the out¬ 
put of both industries, and also their demand for labour, to zero, since 
it exceeds labour’s long run unique marginal product. At this (W/n)* 
there is a particular labour supply which defines the level of employment 
and output. If we assume that the non-traded sector is labour-intensive 
(/? is larger than <5), then as r rises, both e and W/n fall. This is the 
assumption made in Figures 10.10 and 10.11, giving an upward-sloping 
long-run OS curve (otherwise it would slope downwards). What we are 
seeing here is the standard Heckscher-Ohlin-Samuelson model result that 
a rise in the cost of capital raises the relative price of the capital-intensive 
industry and lowers the real wage. 
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As r 0 falls to r-i, e * 0 rises to e*-, (assuming NT is 
labour-intensive) 


Figure 10.11: Supply and demand under perfect competition in the long run 








11 

Representative agent models in 
cashless economies 


Macroeconomic models have traditionally been constructed from aggre¬ 
gate demand and supply relationships. These relationships are in turn 
justified by ‘micro-foundations’: that is to say, it is shown that people 
or firms (agents) maximizing their utility or expected utility would on 
average behave according to these relationships. To obtain aggregate 
from individual behaviour, assumptions have been made about the dis¬ 
tribution of preferences and technological idiosyncracies: often, appeal 
can be made to the central limit theorem, which says that the mean 
of N independent random variables has a normal distribution, with a 
standard deviation equal to that of the component variables divided by 

Vn. 

Nevertheless, the parameters of these relationships are only loosely 
tied to the parameters of the preferences and technology of individual 
agents. We saw this earlier when discussing Lucas’ critique (Lucas, 1976) 
of econometric models: certain of the model’s parameters will shift under 
changes of policy regime or other exogenous variable processes. Yet we 
would like to have models which can be used to evaluate the effects of 
just such changes. This argues the need for models whose parameters 
are solely those of preferences and technology, so-called ‘deep structure’. 

In response to this need, a variety of models have been produced 
which make dramatically simple assumptions about preferences and tech¬ 
nology, in order to make a complete treatment practicable. Often tech¬ 
nology is reduced to stochastic per capita endowments of a single con¬ 
sumption good and consumers are treated as identical representative 
agents. 

These models are obviously no use for traditional forecasting and pol¬ 
icy analysis where we try to predict and control particular sequences of 
events. But this is not the use for which they are intended. Rather it is 
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to model an economy in the sense of mimicking and so understanding its 
(average) time-series properties when shocked; from a policy viewpoint, 
this should guide us towards policies which improve those properties. 
This is a long term perspective. With it tends to go the view that 
economists and policy makers should only be concerned with these av¬ 
erage properties over the long term and not with short term sequences 
of events. 

There are horses for courses. In practice, whether they ought to 
be or not, economists are called upon to help in both the short- and 
the long-term aspects of problems. Provided they are honest about the 
shortcomings of the tools they use in both contexts, we can see no ob¬ 
jection to them earning their living doing both. For this reason, we have 
presented the more traditional models already and now proceed to give 
an account of these deep structure models. Inevitably, it is too short to 
be more than an introduction (see Sargent, 1987, for a fuller treatment). 


THE BASIC STRUCTURE OF A REPRESENTATIVE 
AGENT MODEL 

In these models the representative household maximizes expected utility 
subject to its budget constraint; the government spends, levies taxes and 
prints money subject to its budget constraint; and markets clear, impos¬ 
ing general equilibrium. From this structure it is possible to derive rela¬ 
tionships between the stochastic shocks and macroeconomic outcomes, 
such as consumption, interest rates and prices. 

Consumer Maximization 

Take first the household’s decision. In a non-stochastic world it would 
typically be assumed to maximize a time-additive utility function: 

OO 

U =£ A(ct) (1) 

t= o 

Ct is consumption, and u is a well-behaved utility function with posi¬ 
tive and diminishing marginal utility of consumption. Let its budget 
constraint be: 

At+i = Rt(A t + Dt — Ct) ( 2 ) 

where A t is wealth at the beginning of period t, Rt is the interest rate 
(gross, inclusive of capital repayment), and yt is income. To and y t are 
given. 
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Using the Lagrangean method, we can write the maximand as: 

L = u(c 0 ) + f3u(c\) + (3 2 u(c 2 ) + ... + M 0 [Ai — i?o(^o + 2/o ~ c o)] 

+ At i[A 2 — i?i(Ai + 2/1 — ci)] + At 2 [^3 — ^ 2(^.2 + J/2 — ci)] + ... (3) 

yielding the first-order conditions with respect to the consumer’s choice 
variables Co, ci,...; A\, A 2 ,... as : 

SL SL 

0 = — = u'(co) + n 0 Ro] 0 = — = Hq - AU-Ri 

SL 6L 

0 = — = /3u'(c 1 ) + AtiRi; 0 = — = a H - Lh R 2 (4) 

where v! =j L . 

Equation (4) yields a string of relationships between the marginal 
utility of consumption in one period and the next: 

u\c t ) = f3Rtu'(c t +i) (5) 

The household equates its marginal rate of transformation 
u'(c t )/(3u'(c t+ 1 ) 

with the gross rate of interest, a result illustrated in figure 11.1. 

As figure 11.1 and equation (5) suggest, one can split up the con¬ 
sumer’s problem into a sequence of two-period decisions. Given ( A t +y t ), 
he decides c t +\ relative to c t : that is, he can either decide c t+ 1 if he has 
already decided on c t or c t if he must consume c t+ \ . This splitting up 
of the decision problem is known as dynamic programming. I 11 dynamic 
programming, each period’s consumption is first solved given the last pe¬ 
riod’s consumption, wealth and income: finally the initial consumption 
level is set so that all assets are ultimately consumed. 

For many purposes we shall only need to consider the first stage 
of the decision process. Occasionally, dynamic programming solutions 
are presented backwards from the future (that is, consumption in one 
period is solved given consumption in the next period): this method 
is convenient if there is some fixed terminal point from which one can 
work back (at which for example the consumer dies, leaving a fixed or 
no bequest). But it is obviously only a presentational matter whether 
one period’s consumption is seen as depending on last period’s or next 
period’s. 

The consumer decides what to do one period at a time, and in princi¬ 
ple he can recompute his decision for the next period when it comes: he 
decides cq this period (with a plan for ci, c 2 ,. . next period he decides 
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Figure 11.1: Intertemporal consumer choice 


Ci (given Co, then in the past, and with a plan for C2, C3, ...). Of course 
in this problem with no stochastic shocks he will always stick to his orig¬ 
inal plan (this is not to be confused with the time inconsistency of policy 
makers who can influence other people’s decisions and then recompute: 
our consumer only affects himself). He might as well decide at the start 
on his consumption plan and just carry it out without further thought. 

We now turn to a stochastic environment, where in each period a par¬ 
ticular shock is realized. The consumer will have decided on consumption 
in the previous period based on his expectation across all possible shocks. 
He will also have a plan for his consumption in future periods; this will 
be a contingency plan, in which his consumption will depend on which 
shocks occur. Since he does not actually have to decide irrevocably on 
future consumption until the period involved, this contingency approach 
is the optimal one: he maintains his flexibility until the last possible 
moment. Then as the shocks are realized, he picks the relevant branch 
of his contingency plan. Figure 11.2 illustrates. 

We can think of this equivalently as the consumer either recomputing 
his best expected plan each period or as computing at the start a total 
contingency plan and carrying it out as the shocks are realized: the point 
is that the consumer is making use of his potential flexibility in the face 
of shocks by deferring decisions on actual consumption until he has to 
take them. 

The consumer is assumed to maximize expected utility in this envi- 
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Figure 11.2: Contingency plan for consumption, with the shock each period 
taking two values (o, b) 


ronrnent, or 

OO 

U 0 = E 0 ]T ?u(ct) 

t= o 

subject to A t+ 1 = Rt{A t + y t — c t ), where R t = R( 1 + e t ) for example 
but yt is a known series. We can now use the principle of dynamic 
programming and take each period’s decision separately. 

We can write Uo, substituting ci out from the constraints, as: 

Uo = u(co) + E 0 f3u(Ro[A 0 — co] + yi — A 2 /R 1 ) + Eq(3 2 u(c2 ) + ... (6) 

Maximizing this with respect to Co (Aq given) gives: 

0 = ^ = u'(co) + Eopu'(ci)(—Ro) 

0C 0 

or 


Analogously at t 
obtain: 


u'(co) = EofiRou'ta) (7) 

1, the consumer maximises U\ (A t given) to 


tt'(ci) = E\(3R\u' ( 02 ) 


(8) 
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and in general: 


u'(ct) = E t (3R t u\c t+ 1 ) (9) 

which is the expected analogue of (5). By the law of iterated expectations 
it follows that: 


E 0 u'(ct) = /3E 0 Rtu'(ct+ 1 ) (10) 

We can reach this general result more compactly by using expected 
Lagrangeans, taking the expectations operator through the Lagrangean 
multipliers. 

Let the consumer be faced with a given realization A t ; then he can 
choose c t , c t+1 , A t+ i to maximise (at t) U t subject to the constraint, or 
the expected Lagrangean, 

L = Ut + E t {p, t (A t +1 — Rt[A t + yt — Ct]) + 

^t+i(At+ 2 -Rt+i[At+i + y t+ i_c t +i]) + ...} (11) 

C T 

0 = j— = (3 t u'(ct ) + EtHtRt ( 12 ) 

OCt 

SL 

0 = = _ Etih+iRt+i ( 13 ) 

SL 

0 = ^—- = E t f3 t+ iu' (c t +i) + E t n t+1 Rt+ i ( 14 ) 

Equation (9) follows by substitution. 


GENERAL EQUILIBRIUM 


Equation (9) can be turned into a pricing formula for an asset. Suppose 
the asset yields an uncertain dividend, dt+ 1 , and has a current price, pt, 
so that 


Rt = 


Pt.+i + dt +1 
Pt 


(15) 


Then 


u'(c t ) = E t (3( 


Pt+i + dt +1 


)u\c t + 1 ) 


Pt 
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or 


Pt = 


P 

u'(ct) 


Et.u\ct + i)(pt+i + dt+ 1 ) 


(16) 


Since all consumers are identical, the only way this asset will be held 
(by anyone and everyone) is for (15) to hold. 

One way to create a simple economy with an asset market is to follow 
Lucas’ tree model (Lucas, 1978). Let the only asset be an identical 
tree, one initially at t belonging to each consumer, who has no other 
source of income. Let each tree produce non-storable fruit, an all-purpose 
consumption good, in the quantity dt . Let the number of trees belonging 
to each consumer be S t ; the supply of trees per consumer is S = 1. 

Fruit can be exchanged for trees at the price p t (units of fruit per 
tree). But Walras’ Law implies that if the fruit market is in excess 
demand, the tree market is in excess supply. Therefore market clearing 
across the whole economy (trees and fruit) implies that 


c t = d t (17) 

Equations (15) and (16) therefore constitute our model of the econ¬ 
omy, yielding the compact form: 

Pt = E t v! (d t+ ±)(pt+i + d t+ 1 ) (18) 

Substituting successively for E t pt+ 1 , E t pt+ 2 , and using the law of 

iterated expectations (E t Et+i = E t ), yields: 


Pt = E t 


(3u'{d t+ 1 ) , a2 u'(dt + i)u'(dt +2 ) , 

®‘+i+P —77TW771-\ m_|_2 T 


u'{d t ) 


u'(d t )u , (d l 


t+ 1 ) 


- 2^ P ~77T^ dt +J 


3 = 1 


u'(d t ) 


(19) 


Depending on the form of the utility function, one can solve for p t as 
a function of the current and expected d. One convenient case is where 
u{ct) = lnc t so that u'(c t ) = in which case 


Ft 

p t = E t Y J Fdt = -±—d t 

3 =1 ; 


( 20 ) 


In this case the asset price varies with whatever stochastic process drives 
the harvest of fruit. 
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THE GOVERNMENT BUDGET CONSTRAINT 


In this general equilibrium framework with no money, let us introduce a 
government issuing debt, bonds, each of which pays next period one unit 
of the fruit, regardless; b t +\ is the number of such bonds outstanding at 
t (b t the number issued in t — 1). Its budget constraint will be 


9t — Tt + bt — bt+ 1 /Rit ( 21 ) 

where g t = government spending per capita, T t = taxes per capita (an 
equal poll tax), R t = the one-period-ahead rate of return on debt . 

For this debt to be held, it must be as with trees that 


u'{c t ) = /3E t Rtu'(ct+ 1 ) 
Since R t is certain, this implies 

E t u'(ct+ 1 ) 


Rt 1 = 0- 


u'[ct) 


Now market clearing of the fruit market implies: 


( 22 ) 


( 23 ) 


ct + gt. — dt 


( 24 ) 


The consumer’s budget constraint is 

d t S t - a - T t + b t + p t S t = + p t S? + 1 (25) 

-ttl t 

where S t is his existing holding of trees and Sf +1 is his desired holding 
of trees for next period. 

Using the two budget constraints, (20) and (24), and fruit market 
clearing, (23), Walras’ Law reappears for debt and trees together: 

+ PtS t =+ PtSf + 1 (26) 

rt t Cf 

Since debt and trees are perfect substitutes at the R t given by (22) 
and p t given by (18), any excess supply of debt (excess demand for trees) 
is eliminated by an infinitesimal movement in either p t or i? t . 

The Ricardian equivalence result immediately follows in this model, 
that taxes are irrelevant, to consumption from (23), and to interest rates 
and asset prices: 

R -i _ g Etu'{d t + 1 - fft+l) 

* P u'{d t -g t ) 


(27) 
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pt =Y F EtU ' (dt+ > 9t+j)dt+j (28 ) 

u'(d t - g t ) 

Only the path of GDP and government spending matters. This is be¬ 
cause households are infinitely-lived so that the pattern of (lump-sum) 
taxes does not affect their life-time consumption possibilities or perma¬ 
nent income. 


THE PRICING OF CONTINGENT CLAIMS 


We saw earlier (equation 15) that the price of an asset paying a stochastic 
dividend, d t , was: 


Pt 


=m 


u'(ct+i)(pt+i + d t+ i) 
u'(ct) 


(29) 


The price and the dividend is in units of the consumption good, 
‘fruit’ or whatever. Now consider a claim which pays out one unit of the 
consumption good in t, + 1 when the state of the economy, some vector 
Xt+i, has a value between xo and x\. By extension of (28) its value will 
be: 


qt[x o < x t+ i < x\\ 


P 


u'(c t ) 


u'(ct+i[xt + i])f(xt + i,xt)dx t+ i 


(30) 


f(x t +i, Xt) is the probability density function over Xt +1 (given that x t 
has occurred); integrating the area under this function gives the proba¬ 
bility of xt +i lying in the range xo to x\. Equation (29) says that the 
price of a contingent claim is the marginal utility of one unit next period, 
relative to this period’s marginal utility if Xt +1 lies in the range, times 
the probability of its lying in that range. 

Equation (29) allows any contingent claim to be priced. One simply 
specifies the range of contingency, evaluates the marginal utility of con¬ 
sumption in that contingency, multiplies by the pay-off in units of the 
consumption good, and weights each part of the range by its probability. 
Equation (29) can also be derived directly from the consumer’s maxi¬ 
mum problem subject to a budget constraint containing the contingent 
claim: this is left as an exercise for the interested reader. 

Contingent claims which nest other contingent claims within them 
(for example, a claim on two-period ahead consumption given Xt +2 and 
Xt) must be consistent with the claims nested within them; otherwise 
arbitrage opportunities occur. Hence, for example, a claim on consump¬ 
tion two periods ahead must have the same price today as the current 
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price of a one-period ahead claim on consumption two periods ahead 
times the current price of a claim on consumption one-period ahead. 
This is exactly analogous to the arbitraging of interest rates of n-period 
ahead maturity with the n one-period interest rates for the intervening 
periods. 


GOVERNMENT BONDS AS NET WEALTH: MODELS 
WITH INTERTEMPORAL AND GEOGRAPHIC CON¬ 
STRAINTS 

We saw above that, subject to the solvency condition, a government 
could borrow as much or as little as it liked with no effect on the economy, 
assuming its taxes were lump sum (distortionary taxes are another mat¬ 
ter as we showed in chapter 7, where we discussed the Lucas-Stokey op¬ 
timal tax-smoothing proposition) — this is Ricardian equivalence. Yet a 
number of authors have been impressed with the role that a government- 
issued liability could perform by intermediating between people who may 
not be willing to lend directly to each other for some reason: they will 
still be willing to lend to a government which in turn may transfer, or 
lend, to others. Of course, such government bonds will generally affect 
economic outcomes and be net wealth. 

Two main sets of reasons have been advanced why people would be 
unwilling to lend to some other people: one is death (the young will not 
lend to the old because the old have no incentive to pay it back after 
death), the other is geographic isolation (members of one tribe will not 
lend to members of another if they cannot see them again to reclaim the 
debt). 


OVERLAPPING GENERATIONS MODELS 

Samuelson’s overlapping generations model, used (see chapter 7) by 
Barro (1974) with a bequest motive in order to re-establish Ricardian 
equivalence, has been used extensively by Neil Wallace and his colleagues 
under the assumption of inter-generational indifference (no bequests) to 
establish a role for government liabilities. 

Suppose all generations are made up of N identical agents, whose 
income stream in perishable consumption units is y — e and e in their 
youth and old age respectively: 0 < e < y/2 so that they obtain more 
income when young than old. Consumption when young and old is 
respectively c^(t) and c^(f+l) for the hth agent of generation t. Assume 



282 


Extending and Deepening the Models 


there is no investment opportunity other than a loan market, but that 
the government has no borrowing, tax or spending programme initially. 
Then we can easily show that there will be no lending by agents when 
young, at all: the old will not lend (because they will not be alive to 
be paid back) or borrow (because they cannot pay back when dead). 
Assume the consumer maximizes a logarithmic utility function 

Ut(d?(t),d?{t + 1)) = In c't (t) + In c'f(t + 1) (31) 

subject to 

y-£~Ct(t) = It 

£+[l +r(t)]lt = Ct(t + 1) (32) 

where r(t) is the net rate of interest, to be determined by the clearing 
of the loan market (this by Walras’ Law also clears the goods market). 
His Lagrangean is therefore: 


J = ln c h t (: t ) + In c' 1 (t + 1) + n't (t) [y-e- $ (t) - %] 

+ Mt (t + l){e + [(1 + r(t)]lt - Ct(t + 1)} (33) 


The first-order conditions yield: 


h(f) _ c?(t + l) 

1 +r(t) 

The consumer’s life-time constraint is: 

<$(!+ 1) 




1 + r(t) 


= y~e + 


1 + r(t) 


so that 


(34) 


(35) 


h , . y — s e 

c t (*) - 2 + 2(1 +r(t)) 

and 

,h = y- £ _ g / 2 

2 1 + r(t) 


(36) 


(37) 


Market-clearing requires that H = 0 anc ^ hence l ^=0 since all 

h 

agents are identical. Consequently 

1 + r(t) = —-— < 1 (38) 

y- £ 


Negative interest rates are required to induce people to consume all 
of their youthful income, since no one is available to make a loan to. 
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Now let the government borrow (a one-period loan) to spend in the 
first period only, the sum G(l) = iV[| — e]; it pays off its debt from taxes 
(r) on the young in period T. Its budget constraint is: 


G(t) + L 9 (t) + [1 + r(t — l)\L 9 (t — 1) (39) 


where L 9 (t) is the loan it takes out in t. and it starts with L 9 { 0) = 0. In 
this case, this implies 


G(l) 

L 9 (t) 


-l®(i) = jv[|- 

[1 + r(t - 1 )}L 9 (t 


1) {T>t > 2} 


and 


J}4(T) = -[1 +r(T- 1)]L 9 (T- 1), setting L 9 (T) = 0 

h 


The loan market equilibrium in period 1 is now: 


0 = L®(l)+£tf 

h 


= —N 


\ V --e] 

+ N 

12 



y-£ 

2 


g/2 

1 + r(l) 


(40) 


(41) 


which is only satisfied if r(l) = 0. This government intervention has 
therefore raised interest rates by siphoning off the young’s expendi¬ 
ture into loans. Subsequently, the government debt remains constant: 
L 9 ( 2) = L 9 ( 1), which implies that r(2) = 0; and so on until T, when 
L 9 (T) = 0 but the tax bill reduces the income of the young, and 0 = 
gives 1 + r(T) = . Thereafter of course the equilibrium gives 

r(t) = ^ (t > T) as in (37). 

The implications for consumption patterns of this intervention are 
beneficial to every generation except the Tth and beyond. Generations 
less than T now consume: 

c?(t) = l = ct(t + l) (42) 

whereas previously they consumed y — e and e respectively in t and t + 1. 
Figure 11.3 illustrates the improvement in their welfare. The 45° line yy 
shows each generation’s consumption possibilities when young and old, 
provided there is some mechanism (like the government loan sequence 
above) to ensure that when it transfers resources to the old, the same 
will be done by the next generation to it. Clearly the optimal point is 
along the 45° line from the origin. 

The Tth generation is worse off, as its consumption pattern becomes 
| in youth (because of the tax bill) and in old age. The following gen¬ 
erations are of course unaffected. 
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Figure 11.3: The welfare effect of consumption smoothing on the OLG model 


This would therefore appear not to be a Pareto-improving policy 
across all generations. However, in an economy with an infinite life, it can 
be made so by deferring repayment of the government debt indefinitely: 
the Tth generation then never arrives. In this case the government bond 
issue is a source of net wealth to the country, even if the initial output 
diverted to government spending, G(l), is thrown away! 

The gain, to repeat, lies in the ability of government bonds to ef¬ 
fect loans from the young generation to the old. The young invest in 
government debt, the old pay it off; but they do not have to deal with 
each other directly. Notice that this government loan sequence effects 
exactly the same as a pay-as-you-go pension scheme, taxing the young 
by y — e— | and giving this as a pension to the old; so here a loan market 
in government debt is all that is needed to enable people to make their 
own pension provision efficiently. 


GEOGRAPHY: THE BEWLEY TOWNSEND 
MODEL 


A similar point about the usefulness of government debt can be made 
about two communities, each with uneven income patterns over time, 
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but which are separated in space rather than by time as the above gen¬ 
erations are. Let community A people have an income stream in per¬ 
ishable consumption units of y — e and e (a variable harvest say) while 
community B people have the stream e and y — e, in even and odd pe¬ 
riods respectively. All A and B people share identical preferences. But 
the A people will not lend to the B people, or vice versa, because the 
two groups rarely meet (they may trade but one group having traded, 
their next trading session is with another group); Bewley (1980) and 
Townsend (1980) suggest a ‘turnpike’ with A and B people passing each 
other in opposite directions, meeting once but never again. The A and 
B people both live for T + 1 periods (where we can allow T —> oo); 
assume there are N each. It is obvious that A people will consume in 
even and odd periods y — e and e respectively, B people e and y — e; 
their consumption pattern will be as uneven as their income pattern. As 
with our overlapping generations model, a government which borrows 
can smooth their consumption. 

Write its budget constraint per capita as: 

9t + It = T i + (1 + r(t ~ (T > t > 0, given l 9 _ x = 0) (43) 

The government levies the same tax, r* on everyone; gt and If are 
per capita spending and one-period loans. 

Each hth consumer maximizes 

T 

p' u ( c t) 

t =0 

subject to 

c h t + If < yf + (1 + r(t - 1 ))#_, (T > t > 0 
given = 0). 

The market equilibrium in loans is: 

\l? + \lf+l 9 t= 0 ( 44 ) 

Let us for maximum simplicity assume the government merely acts 
as a lender and borrower, and does not use its tax or spending powers. 
Then If = 0 (net) for all t and it follows from market equilibrium that 

cf + cf =y (45) 

The consumer’s optimum in the usual way yields: 

u'(ct + 1) 


(3{l+r(t)) 


(46) 
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Hence since from (44) cf = y — cf 

u'{cf) = u’(y ~ c-t) 

u '( c ?+ 1 ) u’(y-cf + 1 ) 


/3(1 + r(t)) 


(47) 


It follows that cf = c^and 1 + r{t) = full consumption smooth¬ 
ing with the rate of interest equal to the rate of time preference. Imposing 
the terminal condition that all loans must be paid off (so that the present 
values of consumption and income are equal), and letting T — > oo, yields 

cJ -V-* 1 '® (48) 


1-/3 


1 -/3 Z 


This is the result of equating the present values of As constant infi¬ 
nite consumption stream and of As alternating infinite income stream, 
starting y — e. Hence 


, A _y- U 1 - P) 

1 + /3 


and 

b _ /3y + e(l-/3) 
C 1 + /3 


(49) 


This outcome is illustrated in figure 11.4, a box diagram with As 
preferences running from the bottom left-hand corner and B’s from the 
top right. Because of positive time preference, their indifference curves 
have a slope of —/? along the 45° line between these two corners (compare 
Figure 11.1 where the slope is —1); A and B agents being identical, their 
contract curve lies along this 45° line. The autarchic point is at a; the 
optimum at b is where the budget line going through a with the slope 
— (1 + r) -1 = —/? cuts the contract curve. 

The point about the role of public debt here is that it enables trade 
(‘geographic smoothing’). With each community made up of infinitely- 
lived households, then government borrowing (and its tax pattern over 
time) will make no difference to the pattern of consumption and asset 
prices — that is, there is Ricardian equivalence. Government debt here 
creates net wealth only to the extent that it facilitates trade: it is act¬ 
ing like the capital account of the balance of payments. However, if 
the capital account is already operating, this effect of government debt 
disappears. 


THE REAL BUSINESS CYCLE MODEL 

In chapter 3 we referred to the research agenda of ‘real business cycles’ 
(RBC), in which the shocks driving the business cycle are identified as 
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The 45° line is the contract curve, b is 
the optimal feasible point, given 
endowments 


Figure 11.4: The Bewley-Townsend model 


coming from productivity and tastes rather than from monetary policy; 
the reason being that rational agents with access to good up-to-date 
information will not either sign nominal contracts or suffer information 
lags about prices sufficient to generate a forward-sloping AS curve — in 
effect the RBC AS curve is vertical. Even though nowadays we would 
generally ‘augment’ such a model with nominal rigidities that would 
give monetary shocks a role, this uncompromising agenda has produced a 
large volume of work and insights into the economy’s behaviour following 
Kydland and Prescott (1982), the prime instigators: a large selection is 
provided by Hartley et al. (1998). 

In early post-war macroeconomic thought productivity growth was 
treated as a smooth and predictable ‘deterministic trend’, that is a (nor¬ 
mally loglinear) time trend; so were changes in tastes that for exam¬ 
ple through changing labour supply affected output growth. Thus the 
economy could be thought of as a time trend of ‘potential output’ with 
business cycle movements around it: 

Vt =7 1 + Vo 


(50) 
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The movements around it were seen as due to demand shocks. The 
RBC model denies this; shocks to potential output are random walks 
with drift (with possibly additional serial correlation which we ignore 
for simplicity) 

&Vt = 7 + Vt (51) 

Here 7 is the deterministic trend (the drift), but the key point is that 
potential output is also constantly being shocked upwards or downwards 
by permanent changes in tastes (leisure preference) and productivity (for 
example upwards by innovation, downwards by a new technology that 
outdates existing capital). If we integrate (50), we obtain: 

0 

Vt = 7* + Vo + rit ~i ( 52 ) 

i—t — 1 

which shows that potential output reflects the cumulant of past shocks 
to technology and tastes (dominated probably by productivity shocks to 
the production function). 

When one thinks about the matter, it is clear that this rather than 
(49) must be correct, since technological change is by definition unpre¬ 
dictable and yet once it has occurred its effect is permanent and (unless 
and until some new change) irreversible. The seminal piece of empirical 
work that showed (50) fitted the facts better than (49) was Nelson and 
Plosser (1982). 

Given that the progress of potential output is random, this source of 
shocks can also disturb the economy in the short run, with the assistance 
of propagation mechanisms. First, investment in the new technology will 
be spurred by its arrival; the additional capital will take time to build, 
generating a delayed stimulus to demand. Second, consumer-workers will 
adjust their consumption and work-plans in response to the new income 
prospect and real interest movements. The business cycle results from 
the interplay of these reactions within a market-clearing environment 
(that is, one where agents are free to make all mutually-beneficial trades). 

Plainly there is nothing to stop us adding nominal rigidities to this 
model; but RBC theorists reject these additions as theoretically ill- 
founded since people will not wish to subject themselves to additional 
(money) shocks when they can set prices in relative terms, either by re¬ 
acting promptly to any changes in money prices or by indexation. RBC 
modellers claim their models can capture the properties of the business 
cycle without these additions. A large empirical literature has grown up 
around these claims, using techniques that are generally different from 
classical regression: in particular RBC modellers reject forecasting as a 
valid test of a model. A model should be a mock-up of the economy, 
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to provide insight into how the economy works; thus its ‘cyclical prop¬ 
erties’ should be like those of the economy. This does not imply one 
can forecast the economy well from a given initial situation, because one 
cannot forecast the shocks that will occur. RBC modellers favour com¬ 
paring the unconditional moments of the model’s simulations with those 
of the economy: especially the second moments, the variances and co- 
variances of key variables like output, employment, interest rates, wages 
and prices. 

We now write down a simple RBC specification. So far our represen¬ 
tative agent models have taken the capital stock (e.g. ‘trees’) as given: 
income has come as ‘endowments’ or ‘fruit’. Nor have we included labour 
supply. Essentially we have explored models designed to shed light on 
particular issues — such as asset pricing and the wealth effects of gov¬ 
ernment borrowing; these models have had both infinitely-lived people, 
equivalent (as in Barro, 1974) to overlapping generations of mortals who 
care about their predecessors or their successors or both, and overlap¬ 
ping generations who did not care either way. RBC models draw on 
both; but the main group assume the former. 

So we begin with with a representative household maximizing ex¬ 
pected utility from consumption, Ct, and leisure, 1 — L t , at time 0: 

Eo f 

So 1 -<’ 

where this utility function, usually chosen, is of the Constant Relative 
Risk Aversion (CRRA) type, with a Cobb-Douglas relation between con¬ 
sumption and leisure (7 is the share of market activity in the total of 
consumption plus leisure value and a* is a (leisure) preference shock). 
All units in the model are per head of population. 

The representative firm, owned by the representative household, has 
a Cobb-Douglas production function with a time-to-build lag between 
the decision to invest in a capital project and its appearance as capital. 
The household chooses to consume or to invest in either real bonds or 
the firm’s capital projects. Hence its budget constraint is 

ct+l t +b t = A t K?L\~ a - T t + bt- i(l + r t - 1 ) (54) 


where 

4 

A = y, ki.s t _ i+4 (55) 

2=1 


K t = (1 - 6)K t _ 1 + St—4: 


(56) 
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We have assumed that projects take four periods to complete. Here 
St —4 for example is the project started at t — 4, which is then completed 
in four parts, k\ — k±, each of these being a proportion. Thus s t -4 is 
completed by t, when it appears in the capital stock, A' t ; besides this 
addition there is depreciation 6 . I t , investment, is the sum of the spend¬ 
ing on projects in train: k^St, k^Sf-i, k 2 S t - 2 , fcis t _ 3 -It is assumed that 
a project once started is then taken through to completion: naturally 
enough because of the large sunk cost of a completed section. 

Finally, we have a government which spends 011 necessary public 
goods and raises taxes: 

bt = Gt — T t + 6 t _i(l + (57) 

Market-clearing in goods is: 

c t + I t + G t = A t K?L\- a (58) 

This model is highly nonlinear and cannot be solved analytically, 
except via a (perhaps unacceptably inaccurate) linear approximation. 
To illustrate the ideas of such a model we use a simpler set-up in which 
capital is continuously variable with a one-period lag of installation. This 
changes the consumer’s budget constraint to: 

c t + I< t - (1 - S)K t -1 +bt = A t K?_ x L\- a -T t + 6 t_i(l + r t - 1 ) (59) 

We will also simplify by assuming that households’ decision to sup¬ 
ply labour is predetermined (perhaps by some decision made a long time 
before on patterns of education and work); hence L t itself becomes the 
preference shock and we drop a*. The Lagrangean the household maxi¬ 
mizes at time 0 now becomes: 


A = A 0 f > t {c ^ (1 1 Lt)1 7}1 P + A t{c t + K t - (1 - 6 )K t -i 

,u 1 ~P 

+ b t - A t K«_ x L\~ a +T t - b t - i(l + r t -i)} (60) 
The first-order conditions are: 

0 = - LO'-TVr 1 + At} (61) 


0 = A 0 {At-A t+ i(l + r t )} (62) 



(63) 

(64) 



Representative Agent Models in Cashless Economies 


291 


Eo(a\iAi)L\ 
{E 0 Xi(r 0 + 6)}^ 


(65) 


where 


Ai 


7/3(1 - 

1 - 7(1 -p) 
C 1 


Taking logs of this we obtain: 


lni^o —InT/ q(q:Ai Ai) 1 ~ a L\ 


1 

1 — a 


ln(r 0 + 5) - 


1 

1 — a 


In EqXi 


( 66 ) 


Let us now assume that A In A t = r/ t and A In L t = e* where both 
these errors are normal and iid. We can now make use of the fact that 
when In Z t is normally distributed with an innovation x t then approxi¬ 
mately 


In Et-iZt = E t -i In Z t + 0.5 var(x t ) (67) 


Hence 

In K 0 = —i— In A 0 + In L 0 - —^— ln(r 0 + S) + K 0 ( 68 ) 

1 — a 1 — a 

where 

A'o = —-h 0.5var{(aAiAi) TTr “Li} — var(Ai) 

1 — a 1 — a 

is a constant which depends on the variances of the two errors and their 
covariances with A. 

We can generalise these conditions by letting 0 = t (in other words, 
the period 0 we have been planning from can be any period, t; the plan 
will then treat 0 as t, 1 as t +1 and so on). Hence we can generalise (67) 
as: 

In K t = —!-— In A t + In L t - — ln(r t + 6) + K 0 (69) 

1 — a 1 — a 

It follows from the production function that: 

In Y t = In A t + (1 — a) In L t -\ -In + a In L t _ i 

1 — a 

- ln(r t _! + 6) + aK 0 (70) 
1 — a 

To solve for interest rates we use the first-order condition for con¬ 
sumption (63) (letting 0 = t, 1 = t + 1) with ( 68 ) for the capital stock 
and (69) for output, the exogenous process for government spending and 
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substitute them all into the market-clearing equation (57). This is a 
nonlinear expression in which the expected path of interest rates as seen 
from period t must ensure that markets clear given public spending plans 
while consumption is intertemporally smoothed via investment and bond 
holding. This path (most accurately found via a computer algorithm) 
will be approximately a saddlepath like our solutions in chapter 2 with 
forward-looking expectations; plainly there is scope here as there for long 
drawn-out dynamics from the interaction of adjustment and rational ex¬ 
pectations. It is possible to make some rough linear approximations but 
we do not pursue that here — enough has been done to show that one 
can obtain solutions for output and the other key macro variables ex¬ 
hibiting both variability and persistence that in principle at least could 
mimic a real economy — as explored in the empirical literature. 


CONCLUSIONS 

In this chapter we have examined the behaviour of representative agent 
models in a cashless society. We have seen that where loan and other 
contingent claim markets are complete, these claims are priced so that 
the expected future discounted marginal utility of consumption equals 
its current marginal utility. Since all agents are identical, consumption 
must equal (perishable) output in every period. If a government enters 
the market, then Ricardian equivalence holds: only government spending 
affects consumption and asset yields. Government debt is irrelevant to 
real outcomes. 

Government debt becomes relevant if some constraint on market com¬ 
pleteness prevents optimal loan trades being made between agents. Two 
such constraints were considered: overlapping generations without a be¬ 
quest motive where the young will not lend to the old; and communities 
which are spatially separated so that loans cannot be reliably recov¬ 
ered. In both cases a government which borrows to finance a deficit can 
achieve intermediation between surplus and deficit agents. In the first 
case, Ricardian equivalence is eliminated; in the second case Ricardian 
equivalence still holds for the pattern of government borrowing over time 
but government willingness to borrow and lend enables trade to occur, in 
effect proxying the capital account of the balance of payments (chapter 
13); of course if capital transactions are already possible, this role for 
government debt evaporates. 

We ended the chapter by setting up a Real Business Cycle model 
whose aim is to mimic the economy’s cyclical properties without appeal¬ 
ing to anything other than maximizing behaviour by entirely rational 
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and well-informed agents — it is assumed that they either are well- 
informed about general price movements or can easily index their own 
wages and prices to general prices. In this model government policy only 
matters for the business cycle to the extent that government spending 
fluctuates: there is Ricardian equivalence so that tax rates are irrelevant 
and monetary policy is entirely ineffective. The literature exploring this 
model empirically has not surprisingly generated considerable contro¬ 
versy which continues on many fronts. 
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APPENDIX 11A THE TECHNIQUE OF DYNAMIC 
PROGRAMMING 

Suppose we write a maximization problem at time t = 0 as follows: 
Maximize at t = 0 

T 

M = ^2 P tr ( x t, Ut ) + P T+1 R{xt+i) (1) 

o 

where (xo is given) 

x t +i =g{x t ,u t ) (2) 

The r function is the value or return produced in period t; R is the 
terminal r function, the value in the last period, which the agent is as¬ 
sumed to be unable to affect with his instrument from T + 1 onwards. 
x t is the state (e.g. of the economy) variable; it may be a vector of 
variables but we will treat it for simplicity as a single variable. u t is the 
variable to be used as the instrument of maximization. The g function 
is the model of the economy relating the state at t + 1 to the previous 
state and the instrument. There is no uncertainty. 

The dynamic programming method is to maximize M in two-period 
segments, taking the results from other periods as being already max¬ 
imized. It thus breaks down the problem into T problems. For conve¬ 
nience, start with segment T : 

Maximize 

(3 T [r(x T , u T ) + (3 R(x t + i)] (3) 

This takes xt as given (in effect by separate maximization of pre¬ 
vious segments) and, since there is no ut+ i by assumption, it must be 
maximized by choice of Ut ■ This gives rise to the following first-order 
condition: 

dr(x T u T ) dR T +idg(x T ,u T ) 

du T P dx T +i du T 1 ’ 

Since r, g and xt+ i are all functions of xt and ut, this gives us a 
maximizing solution for Ut in terms of x t- 

ut = Kt{xt) (5) 

Denote the maximizing value of Xt+ i correspondingly Xt+ i- Then 
our T segment becomes: 

[r(xT,ur) + (3 R{xt+ i)] = Vt(xt) 


( 6 ) 
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Vt(xt) is the ‘value function’ as seen at time T; by this is meant the 
function which, given xt, extracts the most value from subsequent uses 
of the instrument (i.e. here Ut)- 

We must now repeat the operation for the T — 1 segment: 

[r(xT-i, ut-i) + (3Vt(xt)\ (7) 

where Vt{xt ) is the present value at T of the last, T, segment since this 
part is what needs to be maximised at T — 1, having had its maximizing 
ut chosen (and so has been reduced, after solving for u T in terms of xt, 
to an expression solely in xt)- Thus we maximise at T — I an expression 
that already allows for the effect of future maximization at T. Again we 
obtain the first-order condition: 

dr{x T -iUT-i) , dV T dg{x T - U UT-i) , Q , 

°" + / W (8) 

This again solves for: 

ut -i = hr-iixT-i) ( 9 ) 

Again we can rewrite the T — 1 segment as: 

[r(xT-i,UT-i) + @Vt(xt)\ = Vt-i(xt-i) (10) 

Vt-i(xt-i) analogously is the value function seen at T — 1; plainly it 
includes the value function at T within it, discounted by (3. We may 
continue backwards along the time segments obtaining: 


[r(XT-2, Ut-2) + ( 3 VT-l{XT-l)\ = Vt-2{xt-2) 
[r(xT-3,UT-3) + 0Vt-2(xt-2)] = Vts{xt- 3 ) (71) 

[r(xo, uo) + (3V\(x\j\ = Vo(x 0 ) (11) 

Vb(xo) is the maximised value of M, that is, when the whole path of 
xi, X 2 , , xt +i has been maximized by the path of uq, ui, ut- 

This segment-by-segment technicpie is highly convenient for thinking 
about uncertainty. It is usual to assume that at each period the agent 
can change his current and future instrument settings. If we think of 
him maximizing the expected value of (1), then he will at T choose a ut 
that maximizes the expected value of (4), and (6) will be the discounted 
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value function, as expected at T ; this will be his contingency plan for 
T. Vo(xq) will then be the expected value function at t = 0; and within 
it will be nested expectations at t = 1, t = 2, t = T. We can then 
use the law of iterated expectations to convert the expression into an 
expected value at t = 0. 

Reverting to the case of no uncertainty, we can see that it implies in 
(9) an optimising rule relating the instrument to the state. Plainly if we 
can find this rule, then we can substitute it into M in (1) and so find the 
value function, the maximum value our agent can obtain. Such a rule, 
which in general will not be time-invariant, would be: 

Ux = hx{xx) ( 12 ) 

In the case of uncertainty, this rule is to be found from the expected 
first-order condition and it becomes a rule for a contingency plan relating 
the planned instrument to whatever the state then turns out to be: thus 
ux becomes the contingent plan value of ux■ If we take expectations of 
(12), it will give the expected instrument as a function of the expected 
state. 

In practice, with models of any complexity there are no available 
analytical techniques for finding the contingent-plan rule or the value 
function: the problem has to be solved numerically via the computer. 
In this respect, the situation is worse than for the linear rational ex¬ 
pectations models we mainly considered in chapters 1-7; for these the 
analytic techniques exist and can be written down, even if in practice 
they too are usually found by the computer. For dynamic programming 
problems the form of the contingent rule has to be guessed in the first 
place; to minimize complexity, set-ups are often converted into linear 
models with quadratic maximands, for which the form of the rule is 
known. However, most of the representative agent models being used 
in modern research are highly non-linear; so such a conversion carries a 
cost in brutal approximation. 

To gain an understanding of solution paths the computer will gener¬ 
ate, it is helpful to work through a simple set-up. 

Let a household maximize 

T 

y: /?* in ct ( 13 ) 

t—0 

subject to 

At+i = Rt{At ~ Ct) (14) 

Ax+i = 0; v4o, Rt(t = 0, 1,... , T) are given. R t is the gross real 
rate of interest. 
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Here the household has a finite life and will leave no bequest. We keep 
its problem to the simple one of deciding how to spend an initial stock 
of wealth, through its lifetime. In terms of our previous notation, we 
define the state as A t and the instrument, u t , as the amount of wealth 
unconsumed in period t, or ‘savings’ for short: u t = A t — Ct . Thus 
At+ 1 = Rt.ut- 

Starting with the last period, T, plainly there will be no savings: 

0 = At+ i = Rt(ut) so that 

ct = At ( 15 ) 

(a corner solution where all remaining assets are consumed). 

The value function at T is therefore: 


Vt(A t ) = In Ct = In Ay — In Rt—\Ut—\ (16) 

We now maximize the T — 1 segment: 

ln(Ay_i — ut— i) + 0lnR T -iUT-i (17) 

with respect to ut-i ■ 

The first-order condition is (in the standard way equating the margin¬ 
al utility of consumption today with (3 times the marginal utility of 
consumption the next period times the real interest rate): 

0= —— - + _g_ = ^-l-(l + <?)«T-l (18) 

At— 1 — llT —1 UT —1 Ut—i{At—1 — UT~ l) 

so that 

U T-1 = 1 + p A T -i (19) 

that is, 

Ct ~i = i + p AT ~ l ( 20 ) 

The T — 1 value function is now: 

Vy_i(A T _i) = In j—^Ay_i^ +/31n ^Ry-i ^ ^ p Ar-i'j 

= (3 In [3 — (1 + (3) ln(l T /?) + f3 In Rt— i 

H- (1 + /5) In Ay_! (21) 


Now maximize the T — 2 segment with respect to Ut -2 : 
ln(A T _2 - ut- 2 ) + (3[f3 In (3 - (1 + (3) ln(l + (3) +/31ni? T -i 
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+ (!+/?) ln(i?T—2 Mt— 2 )] 


The first-order condition yields analogously: 

(3(1 + (3) 1 

UT - 2 = TTWW) T - 2 ° r CT - 2 = TTWTW) T ~ 2 


( 22 ) 


Proceeding now to maximize the T — 3 segment with respect to ut -3 
yields: 


Ut -3 


f3[l + /?(! + /?)] 

1 + f3[ 1 + /?(! + /?)] 


At-3 or ct ~3 


. 

1 + /3[1 + /J(1 + /J)] T ' 3 

(23) 


Hence we discover that the ratio of savings to wealth starts very close 
to unity, declines slowly at first, and then plunges sharply in the final 
years of life. Notice that even in this simple set-up the reaction rule 
is not constant, the reason being that the end-of-life constraint forces 
wealth to be completely used up; in order to smooth consumption the 
share of wealth consumed must rise over the lifetime (figure 11.5). 


0/, iif/At ■ 


1 . 00 - 
0.75 
0.50 
0.25 -U 





T-3 T-2 T-1 T 


C/A, 


-u t /A t 

> 


Figure 11.5: Consumption of wealth over lifetime 


To find a constant rule requires a problem that does not change over 
time; an infinite lifetime in this case. If we now change the above problem 
to an infinite horizon one of maximising at t = 0: 

OO 

(24) 
t =0 


subject to 


lirn A t = 0; A t+ 1 = R 0 (A t - c t ) = R 0 Ut 

This assumes that the implied one-period interest rate at f = 0, i?o, 
also applied at all future dates (that is, there is a flat term structure). 
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We can now guess at a constant reaction function and the corresponding 
value function. Let us try for Vr the same logarithmic form as the 
consumption function (since consumption will be related to wealth and 
therefore welfare at T related to the log of wealth): 

Vt(At) = k In At + L’o (25) 

and 


ut — aA T 


(26) 


where k, ko and a are unknown and to be found out through the method 
of undetermined coefficients. To check our guess we derive the first-order 
condition and then implied value function; if correct, we can solve for the 
unknown coefficients when we compare these functions with our guessed 
ones. Using our guess, we maximise with respect to ut the T-period 
segment: 

Vt{A t ) = In + (3Vt+i(A t +i) (27) 


where Vt+i(At+i) must be given by (25). 

We find the first-order condition of (27) as: 


ut = 


/ 3k 

l + (3k 


At 


(28) 


We now substitute this into (27) and compare the result with (25): 


Ir ( At) — In 


= In 


1 


-Ai 


1 /3k 

1 

TTfik' 


(3khi(R 0 u T ) + /3ko 
(3 k 


A t + (3k In R 


1 + (3k 


At + (3ko 


= (3k In (3k — (1 + (3k) ln(l + (3k) + (3k In R 0 + (1 + (3k) ln( At) + (3ko 

(29) 

When set equal to the guessed solution, (25), this yields our undeter¬ 
mined coefficients as: 


k = 


1 


1 -(3 


and 


(3k In (3k — (1 + (3k) ln( 1 + (3k) + (3k In R 0 

= 

(3In(3 + (1 - (3) ln(l - (3) +(3lnRo 

o^W 2 



300 


Extending and Deepening the Models 


Pk 

a = TTTk =l3 

Hence we have successfully found a constant rule and value function: 

ir ( a \ 1 , 4 , /? In /? + (1 — /?) ln(l — /?) + /? In f? 0 

U(^ T ) = In +- n —- (30) 


ut — (3 Aj' 


ct = (1 - /?)^t 
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APPENDIX 11B USING DYNAMIC PROGRAMMING 
TO CHOOSE THE OPTIMAL MONETARY POLICY 
UNDER OUTPUT PERSISTENCE 

In chapter 5 we considered the issue of choosing optimal monetary pol¬ 
icy and of time inconsistency. Because the Phillips Curve had no output 
persistence, the problem could be solved using static optimization. How¬ 
ever, suppose that the Phillips Curve exhibits persistence as follows: 


Vt = PVt-i + ot(n t — 7Tj) + e t (1) 

where 7Tj is the rational expectation of inflation for t formed with t — 1 
information; it is also under commitment the inflation rate chosen and 
announced by the monetary authority. 

Commitment 

The set-up is that the central bank has scope to react to shocks- implic¬ 
itly because the wage contract underlying this Phillips Curve is longer 
than the publication/reaction time to the shock. Using the usual loss 
function the value function under commitment is: 


V(y t - 1 ) = Max{wrt'K t ,-K e t )E t _ 1 {-{).b{ix t - tt*) 2 - 0.5A(y t - y *) 2 

+ pV(y t )} ( 2 ) 

The value function form that works for the quadratic loss is also 
quadratic: 

v ( y) = 7o + 7i + 0.57 2 y 2 (3) 

Under commitment there is an additional constraint that the policy 
instrument choice at t — 1 is also to be rationally expected because it 
will be followed through: 


and also 


Et-Wt = 7T( 


(4a) 


7 r t = Et-int + be t (4b) 

which uses the property of rational expectations and b is an undeter¬ 
mined coefficient (here it will be the chosen maximising response of 
monetary policy to the supply shock). 
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Hence maximise 

L t {= {-0.5(7r t - 7T*) 2 - 0.5A (y t - y*) 2 + (3V(y t )} + y( 7R - be t - -n e t ) 

where we have used the combined constraint of (4a) and (4b)), to obtain 
the two first order conditions, the usual one in 77 and then the second 
0 = dEt g ~l Lt respectively: 

7ft = 7r* - a(X - /?7 2 )[P2/t-i + a(n t - 7 t^) + e t ] + aXy* + + y (5) 

and 

y = a(X- f3'y 2 )pyt-i - aXy* - P^at (6) 

Substituting for the Lagrange multiplier, /x, from (6) into (5) and 
taking expectations at t — 1 yields (via the commitment constraint): 

Et-nrt = n* = n e t (7) 

Hence (5) now yields the optimal inflation rule under commitment: 


7Vt ^ ii 2 1 \ a v u x 

1 + a 2 (X - /? 7 2 ) 

To find the value function, one could equate (3) for lagged output 
with the RHS of (2) once the optimal values of 77 and yt are substituted 
in the RHS. But as we are only interested in y 2 in this case we can take 
the first derivative of each side of (2) with respect to y t ~ 1 and equate 
these expressions (the first derivative of two sides of an equality are 
equal via the ‘envelope theorem’). Differentiating the LHS of (2) yields 
7i +7 2 J/t-i- The RHS of (2), remembering that once optimised we take 
expectations of it, is differentiated as: 

p 2 ( x - Pi 2 )yt-i + p^y* + pP 7 i 

from which it follows, equating the LHS derivative with the RHS deriva¬ 
tive, that: 

_ P^V* 

71 1 + PP 


a 2 ( A-/?7 2 ) 


Hence: 


7T t = 7T - 


aX 

1 + a 2 X - 


Notice that the supply shock response in now larger because of the 
need to stabilise future output which will now be affected by the persis¬ 
tence term. 
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Discretion 

Now the problem is 

Max(wrt 7r t ){-0.5(7r t - n*) 2 - 0.5A(y t - y*) 2 + (3V ( y t )} (9) 

where now 7rf is simply the rational expectation of t-inllation formed at 
t - 1. 

Using the same value function, the first-order condition yields: 

'• = i + ^o-0t 2 ) K + “ 2(A " fh ^' t ~ “ (A ~ ' 37s) 

[pyt- 1 + e t ] + aA y* + Pa^} (10) 

Taking expectations yields: 

E^nt =7r* - a(A - @j 2 ) \py t -i] + aXy* + /foqq (11) 

Hence 

7T t = 7T* - a(A - Pl 2 )\pyt-i\ + aXy* + pa^ 1 - : + ^(A^/Uy. ) €t 

= a-be t - cy t -i (12) 

where a, b and c are implicitly defined. 

Now set dV Qy*^ = 7i +722/t-i (the derivative of the supposed value 
function wrt y t - 1) 

U(yt-i) = E t _i{-0.5(a - 7T* - be t - cy t - 1) 2 - 0.5A(y t - y*) 2 

+ /?[7o + 7i Vt + 0.572?/?]} (13) 

where y t = py t - i + [1 - ab\e t 
Hence 

dViyt-x) = c ^ a _ ^ + pAy * + p 0 7i _ { c 2 + p 2[ A _ /3 72 ]}y 4 _ 1 (14) 

°%-i 

Equating coefficients between this and the supposed value function 
derivative yields: 

7! = c(a - 7T*) + pXy* + p(3j 1 

and 

72 = -{c 2 +p 2 [A-/?7 2 ]} 
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For values where the model is well-behaved the lowest negative root 
of this quadratic is relevant (see Svensson, 1997) and we obtain the 
optimizing value of c (and 7 J). This can then be substituted into the 
expression for r ) 1 to yield: 

A y*(p + ac ) 

11 \ — (3{p + ac) 

The basic point is that there is an inflation bias = 1 _ l ^ v p+a ^ +cy t -\. 
Also the response to the current supply shock is excessive because the 
future inflation bias also depends on today’s output; so it needs to be 
stabilised more strongly. To eliminate these twin problems the mon¬ 
etary authority must be prevented from following a (lagged) feedback 
rule, since just as in the Sargent and Wallace ineffectiveness result such 
a feedback component is fully anticipated by wage/price setters and im¬ 
pounded into E t _iTr t . However, provided the feedback rule is off ‘current’ 
output, there is no bias and the optimal stabilization can be achieved; 
implictly the justification for a feedback off current output is that there 
are long-term contracts so that the authorities can react to events before 
the wage/price setters. Hence this set-up can be considered as a simpli¬ 
fied overlapping-contract Phillips Curve, where in effect the wage/price 
setters all contract simultaneously at the start of the period. 

Svensson (1997) considers ways this can also be achieved through 
Walsh contracts, Rogoff ‘twisting of preferences’, altering the inflation 
target and the output target. It is obvious that the Walsh contract 
must be state-contingent to eliminate the lagged output element in the 
inflation bias. The same is true of the output target; it must now be 
equal to the ‘short-run natural rate’, pyt.-i- 

However, altering the inflation target to be state-contingent, while 
it removes the inflation bias, does not restore the optimal response to 
the supply shock. Because it introduces a remainder term (the square 
of output appears now additionally because it enters the inflation tar¬ 
get) into the utility function, it leads to an over-strong response to the 
shock, over-stabilising output. In this case the ‘intriguing’ result occurs 
that if the authority is made more ‘weight-conservative’ together with 
a state-contingent inflation target, the conservativeness can offset this 
over-reaction, so that the optimal result is restored. 

The basic point remains that in practice the central bank must be 
induced to react only to current shocks and not to lagged information al¬ 
ready incorporated into people’s contract decisions. This does in practice 
then require a decision on the relevant length of the ‘current’ period; new 
information arriving within this period should be reacted to, previous 
not. On the logic of long-term contracts overturning the Sargent-Wallace 
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result (Minford and Peel, 2001), this should be the longest period for 
which nominal contracts are written. 



12 


Money in representative agent 
models 


What is money? This seems an odd question to ask. Clearly it is the 
physical item — dollar bill, beads, gold coins — used to pay for goods 
in order to avoid the ‘double coincidence of wants’ required in barter 
exchange. The classic theory of its emergence without social ‘invention’ 
is due to Menger (1892) who develops an evolutionary model. In this a 
good with appropriate characteristics (transportability, wide use etc.) is 
requested in exchange for goods by sellers secure in the knowledge that 
someone else will want it in exchange for the goods they may wish to buy 
— see White (1999). In fulfilling this function money will not pay interest 
because of the inconvenience involved; money holdings would have to 
be dated on their face to pay interest, like bearer bonds. But given 
the frequency with which money changes hands, for each bearer to get 
interest due of a few pennies would involve obviously bigger transactions 
costs. (Mrs. Grocer has to give you interest on your pound note when 
you pay for your bread, after allowing for when you acquired it — an 
unhappy lady.) 

Other theories of how it emerged are of interest but appear in the 
end not to dominate Menger’s insight. We discuss Wallace’s (1980) 
overlapping-generation, OLG, model below; in it money has a store- 
of-value role in the absence of bonds. As a store of value money has the 
serious problem that it does not pay explicit interest. Even in primi¬ 
tive societies without paper bonds there are ways of storing value that 
yield expected return — cattle, inventories, crop credit arrangements 
etc.. There is an analogous problem with money as a store of value in 
the Bewley-Townsend turnpike model. 

The search-theoretic models of Kiyotaki and Wright (1989, 1991, 
1993) are of great interest and essentially a contribution to Mengerian 
theory. They derive the optimal search strategy of a representative agent 
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within an environment where there are a variety of goods which may 
function as money, fiat currency being one but with no intrinsic value, 
the others (commodity moneys) having intrinsic value. They look for 
Nash equilibria with different moneys. Of course the problem with these 
equilibria is how one rather than another is selected. Menger’s evolu¬ 
tionary model implicitly assumes in addition some sort of adaptiveness; 
people ‘gravitate’ towards the initially most promising commodity money 
and later as the attractions of fiat money become evident (or the inter¬ 
ests of a monopoly currency provider, more likely, become dominant) 
they ‘jump’ to a fiat currency equilibrium. In their 1993 paper Kiyotaki 
and Wright show formally that fiat currency adoption is on a knife-edge; 
if most people decide to adopt it everyone will, but if few think of doing 
so, noone will. Selgin (1997) shows that a spontaneous move to hat cur¬ 
rency is unlikely if there is adaptive learning; rather, people are likely 
to adopt commodity money first as the closest alternative to barter and 
then stay with it. These models can be thought of as the beginnings of 
a formalization of Menger’s ideas. 

Suppose we accept the evolutionary theory of money’s emergence. 
Then by the same evolutionary principles we would expect money’s func¬ 
tions in time to be performed by new means. Means of payment emerge 
that use money as a unit of account but for most of the time avoid using 
money as a means of payment — clearing systems, bank accounts, de¬ 
posit accounts with nonbank intermediaries, e-money vouchers and so 
on. The original physical money is the ‘base’ of this system of ‘credit 
payment’ (so-called because you pay with an interest-bearing balance) 
besides defining the unit (see White (1999) again for a compact history 
of intermediation along these lines). 

As this evolution proceeds we should observe that competition be¬ 
tween intermediaries drives the rewards to these functions of money down 
to the cost of provision for each. So effecting payment via a clearing sys¬ 
tem will command a competitive fee, while holding a deposit will com¬ 
mand the rate of interest: there is ‘unbundling’ whereby we can consider 
deposits as held for their returns as stores of value and payment as a sep¬ 
arate service done for a fee. Cash alone, being provided by a monopoly 
supplier (the government typically), will not be subject to competition 
by assumption within its ‘domain’; however, even here evolution may be 
producing competition as it becomes easier (via the internet for example) 
to use alternative units of account with general acceptability. 

Hence to introduce ‘money’ into representative agent models we must 
ask what our purpose is: to model a particular earlier stage of monetary 
evolution or to attempt to model ‘modern times’ (or indeed the possi¬ 
ble future)? In what follows we introduce several examples of money 
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in the economy, starting with models which we view as historical in in¬ 
tention and going on to models of the here-and-now. Finally we discuss 
where some tendencies in monetary policy and in potential competition 
between monetary authorities might lead in macroeconomic behaviour. 

We begin with models which treat money as a store of value (it may 
also be a means of payment but so is any other asset or any good). 
Representative agent models face a fundamental difficulty in this case: 
money, defined as a non-interest bearing asset, has no role to play unless 
one is created for it by an artificial constraint. Consider the models of 
chapter 11. In models like Lucas’, with infinitely-lived agents, there is 
nothing to stop trades in fruit (different sorts presumably) being paid for 
by an interest-bearing claim on fruit-in-general. In the Bewley-Townsend 
model, with parallel communities, government bonds can be used in 
payment. In the OLG models, again government bonds can be used. So 
money is inferior to interest-bearing claims and will have no value. 

Accordingly, these models have a problem. To resolve it, the Bewley- 
Townsend model assumes away government bonds, leaving money as 
the only asset (possibly a government liability). We are presumably to 
assume that this is a very primitive economy in which there is no legal 
or other infrastructure to support lending and borrowing. 

OLG models attempt an optimizing-agent explanation in the pres¬ 
ence of a full menu of assets. They accept that money must offer a yield 
equal to that of bonds, or it cannot be held (will become valueless, with 
the ‘price level’ in terms of it becoming infinite). Then equilibria must 
involve a rate of inflation (or equivalent money creation and distribution 
to existing holders) equal to the rate of interest. 

It is not obvious what application these models have. The OLG 
assumption (with no bequests) breaks up inter-generational links and 
opens up a role for government as an inter-generational intermediary, as 
we saw in chapter 11 with government bonds. It is frankly difficult to see 
why a government would print money in such a world when bonds can be 
issued. (A competitive banking system paying interest on money would 
be a different matter.) However, a number of these models have grafted 
on to the OLG constraint additional constraints: that only money (no 
bond) exists, and that there are regulatory requirements to use it (e.g. to 
pay taxes). (Incidentally, this regulatory theory of money, presumably 
motivated by the government’s desire for revenue, is the nearest any of 
these models gets to an optimizing theory of why money exists.) These 
constraints place such OLG models alongside the Bewley-Townsend one 
as examinations of economies at a rather early stage of monetary evolu¬ 
tion. 
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INTRODUCING MONEY INTO BEWLEY-TOWNSE- 
ND AND OLG MODELS 

In chapter 11 we showed that both the Bewley-Townsend and OLG 
models gave government bonds a role in intermediating between groups 
which would not lend to each other. However, government bonds are a 
relatively sophisticated instrument, coming late in economic evolution. 
Money came first, as a government-backed medium of exchange and store 
of value. It is interesting to ask how these economies would behave if 
there were only money to perform this intermediary function. 

If money is the only available store of value, then to achieve the 
optimal consumption-smoothing (that is, intermediation) that bonds 
achieved (in chapter 11) we require prices to be falling at the rate of 
time preference. 

Consider now the Bewley-Townsend model. Consumers with T- 
period lives maximize 

T 

J 0u{ct) - n't ( p t ct + m h t - p t y t - rrit-i) (!) 

i=0 

with rn A 1 = 0, m B 1 = m given; A agents receive y A = e (odd periods), 
y — e (even); B agents receive y B = y — e (odd), e (even). The first-order 
conditions are: 

°= = / 3'u'{ct ) - h t p t ( 2 ) 

OCt 


SJ 

6 mt 


-fJ>t + Mt+i = 0 for m h t > 0 


< 0 for mj* = 0 (3) 


0 = 


SJ 

SrrJl 


= — p!f = 0 for > 0 


< 0 for = 0 (4) 

The last, terminal, condition implies that for money to be held in 
T (as it must be for market clearing, i.e. Navi a + NbWI B = M), then 
—/ 3 t u'(ct)/pt = 0. But with /3 t u'(ct ) > 0 (since ct < oo, there can be 
no saturation of wants), pr must be infinite: money will all be spent in 
the last period until it is worthless. Working back to T — 1, since = 0, 
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so will = 0, and money will become worthless throughout. So to 
have a monetary economy we have to let T —> oo . Agents live forever 
(a parable for the household which treats the interests of its descendants 
as its own, subject to the constant rate of time preference, /?). 

In this case, the terminal condition becomes for > 0, 

UmT^oo(-^T) = 0 


If prices are falling at the rate 1 — (3, we know that all agents will 
smooth their consumption optimally (as in chapter 11); consumption 
will be constant. Whichever agents are holding money at time t have 

u'{c t ) = (hi\c t+ 1 ) 

Pt Pt +i ° 


But A and B agents hold money in alternate periods (as we saw with 
bonds in chapter 11). Hence (5) holds in alternate periods for A and 
B agents: so it always holds for one set of agents. Therefore since c t 
and u' are constant, ^ Unfortunately, this violates the terminal 

condition, since and therefore limT->oo(—P t) = ~ U pg t ' > ^ 0 

(unless po = oo). 

What this means is that as time goes on the present value of money 
holdings (discounted by the rate of time preference) does not diminish: so 
people find they have surplus money and spend it. This raises pr and the 
path before it, and p does not fall by (1 — /?) each period. This solution is 
therefore not an equilibrium. It follows that the Pareto-optimal solution 
in this model is impossible if only money, not bonds, exists. 

Only equilibria where prices fall more slowly than (1 —(3) are possible. 
One such equilibrium is a constant price one: this is obviously interesting 
as one of the features encouraging the evolution of money would be its 
stability in purchasing power. Figure 12.1 illustrates how price stability 
(offering a zero rate of return) achieves less than perfect consumption 
smoothing. 

Let p be constant at p = 1. Then A agents for example who get e in 
odd periods will consume c** in odd periods, c* in even (c** -< c*), so 
that 


u'{c * *) 
u'(c*) 


1/(3 


( 6 ) 


B agents consume c** in even periods, c* in odd (with (6) holding as for 
A agents), c** + c* = y for both sets of agents, so that c* = y — c**. 
Since there are the same numbers (N = Na = Nb ) of each, this also 
satisfies market clearing. 
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A agents move to an internal equilibrium at c, given zero 
interest rate: lend' a by acquiring money. B agents fail to reach 
an internal optimum. 


Figure 12.1: Monetary equilibrium with a static price level in the Bewley- 
Townsend model 


B agents will hold enough money at the end of odd periods to buy 
c**in the even periods, but will run this money balances down totally in 
the even period. The reason is that in the even period u'{c**) > f3u'(c*), 
so that they would rather spend then than wait and will exhaust their 
money holdings in the attempt: in terms of the first order conditions, 
they are at a corner solution with mj 1 = 0 (f even). A agents follow 
the same pattern in the alternate period. The result is that the money 
supply is passed from A to B each period. 

It is possible to get to Pareto-optimal equilibrium if some way can 
be found to pay interest on money while still satisfying the terminal 
condition. For example, if prices fall at 1 — (3 per period and the money 
supply is reduced at this rate also by lump sum taxes, then this condition 
will be satisfied. But it is of course a highly artificial scenario, hardly 
suited to the idea that this is a primitive economy. 
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MONEY IN THE OLG MODEL 

In the OLG example of chapter 11, we found that, if the government 
borrowed to finance a transfer to the initial old generation, and chose this 
transfer well, it could perfectly smooth each generation’s consumption 
pattern across its youth and old age. By failing ever to pay off the loan 
it maintains this perfect smoothing for ever. The real interest rate is 
zero; because the economy is not growing, this is in this example the 
Pareto-optimal outcome. In this model if the economy were growing at 
the rate n, the Pareto-optimum would be where r = n. 

This is illustrated in figure 12.2, which shows the zero-saving equi¬ 
librium (consumption is y — e in youth and e in old age) with r = r 0 and 
the feasible reallocation along the line —(1 + n), achieved when r = n. 
This reallocation is feasible because the next generation is (1 + n) times 
the current; so if 6 consumption is transferred by each of the young to 
the old each period, then each of the old, being less in number, will enjoy 
5(1 + n). By giving up consumption to the old today then the young 
guarantee an old age in which they will enjoy 5(1 + n). (To achieve this 
equilibrium, the government must increase its debt and spending each 
period at the rate n). 



Figure 12.2: Feasible reallocation of consumption from zero savings (autarchy) 
to optimal intergenerational lending in government bonds — the overlapping 
generations model with growth, n. 

In the example of chapter 11, where n = 0, and bonds paid zero 
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interest, we could easily think of money performing exactly the same 
function as bonds. Let the government issue currency, H(t), instead of 
bonds and let the price level be p(t). Then we can treat = —L 9 (t) 
as equivalent to the previous bonds, and the equilibrium would be the 
same. If currency issue were to be kept constant at Hq, pit) would then 
be constant at po: = 1, implying a zero interest rate on money, 

the Pareto-optimal equilibrium as before. 

So in this case, money could perform the same consumption-smooth¬ 
ing function as bonds, justifying its role in early societies with no growth 
and an unsophisticated financial market. It would also be possible for 
any government to swap bonds for money with no effect on anything 
(prices, consumption, interest rates) in the economy: there is no need 
for the new mix of bonds and money H'(t) + L 9 '(t) to change p(t) since 
H p( ^ +L 9 '(t) = y^- + L 9 (t) and, given that government liabiliies are the 
same overall, nothing else changes either. This is the ‘real bills’ doctrine, 
that open market operations altering the supply of money have no effect. 

Matters become more complicated in growing economies, where the 
optimal r = n > 0. Here it will often be the case that a monetary equi¬ 
librium cannot be optimal. Essentially, this is because in a monetary 
equilibrium prices would have to be falling at the rate n. Yet the gov¬ 
ernment must run a deficit in order to permit intermediation between 
generations. To finance this it must issue more and more currency, which 
will be inconsistent with falling prices. 

However, there will usually exist suboptimal monetary equilibria. A 
large literature exists exploring the properties of such equilibria: for an 
introduction to it see Sargent (1987). 

The main difficulty with using OLG models for the study of monetary 
economies is their emphasis on money as a store of value: this and the 
arbitrage between money and other financial assets produces somewhat 
strange results. For primitive static economies without alternatives to 
money, however, OLG monetary models offer interesting insights. 


THE CASH-IN-ADVANCE MODEL 

To resolve the problem of money’s value in a model of a modern economy, 
it is simply assumed that money is needed for transactions. Money here 
is the physical unit of cash; all other assets and liabilities are bonds, 
physical assets, or claims to them (e.g. equities). It is assumed that 
to make transactions cash is required. In one such model, Lucas’, there 
is a ‘cash-in-advance’ constraint (following Clower, 1965), whereby it is 
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assumed that spending can only be carried out with money, which must 
therefore be accumulated in advance (Lucas, 1980). 

Similar to this, and used in other models of the same general type, 
is the assumption that money ‘has utility’ and is an argument of the 
consumer’s utility function. Effectively, this is being justified by the 
Glower constraint, and is merely an alternative way of expressing it, 
implying a degree of substitutability of money in transactions, whereas 
the cash-in-advance model assumes none. 

If money has no use, it becomes valueless. This is obvious enough. 
Consider some cowrie shells which are stated by some village chieftain to 
be currency: if no one needs them in order to exchange goods (and they 
are useless for any other purpose), then there are two possibilities. One is 
that they have a value in exchange: but if so, they will be spent in order 
to obtain that value from the goods for which they can be exchanged. 
Since everyone will get rid of them in this way, they fall in value until 
they are worthless — the other possibility. 

This is worth briefly demonstrating. Suppose the chieftain (govern¬ 
ment) issues currency of M per capita and will spend ^ = g 0 where pt = 
the price level, g 0 = government spending in period 0: the government 
spends no more, gt = 0 (t > 1). If there exist other assets with a gross 
return R t , then for currency to be held it must have an equal return so 
that 

_P^ = qt+i+dt^ 


where qt is the price of the assets (trees) and dt their dividend (fruit). 
Now consider the consumer’s budget constraint when the market equi¬ 
librium holds: 

q t s -t—- + d t = c t + qtS -\ -— (t > 1) (8) 

Pt Pt 


where s = the number of trees per capita and nit+ 1 = the number of 
currency units held for period t + 1. 

Since Ct = d t by goods market equilibrium, it follows that ^ 1 ; 

that is, is never spent, but it grows at the rate ( R t — 1) each period, 
so that its present discounted value is ^ of course. 

It follows that if there is market equilibrium the consumer can only 
spend d t in every period; and yet if ^ > 0, he will fail to spend all his 
wealth over his infinite lifetime; that is, he will spend all except (in 
present value). This is sub-optimal. The consumer will therefore attempt 
to spend more than dt, which will drive the price level to infinity. Since 
1/pi = 0, money will not be held in period 0, so that l/po = 0, and the 
government will be unable to spend anything (go =0). 
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In order to give money value, it must be given a source of useful¬ 
ness. As we have seen, one way to point to this is through the Clower 
constraint, revived by Lucas (1980). 

In Lucas’ model each household has to acquire money before going 
to buy goods for consumption. Then while one member is shopping for 
consumption, the other householder is selling the household product for 
money, which is taken into the next period. Then that part of it not 
needed for shopping is exchanged for income-yielding assets; and the 
whole sequence is repeated, starting again with shopping/selling later in 
that period. 

Begin at the start of a period, where the houshold has money, and 
assets: one tree, s t _i = 1 whose price is r t (x t ), and government nominal 
debt lf where x t is the state of the economy (the vector of current 
endogenous and exogenous variables). All quantities are measured per 
capita: everyone now goes into a securities-trading session where assets 
are acquired and taxes are paid. The market-clearing conditions for this 
are: 


m p -I- mf = M t+ i (9) 

a t = 1 (10) 

lt+i( x t+i) = k+i(xt+i) (11) 

where M t+1 and l t+ \(x t +\) are respectively the money supply and gov¬ 
ernment claims issued by the government and mF t , If , (ay+i) the private 
sector’s demand for these; m 9 is the government’s demand for money. 

Having acquired money, government and private agents go into a 
goods trading session in which household products are sold for money 
to government and household consumers: 

Ptdt = m\ + m 9 t (12) 

is the household income carried into the next period in the form of a 
money holding. d t is the harvest per tree. 

Pt9t = rn 9 t ( 13 ) 

p t c t = m p t (14) 

are the government’s and household’s consumption respectively. (We 
will assume that the nominal rate of interest is positive so dominating 
the zero return on money — so that consumers never hold more money 
than they plan to spend. Hence (14) is an equality.) 
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Equations (12)-(14) imply that the goods market clears: 

Ptc t + Pt9t = Pt.dt (15) 

and money-market clearing (9) then implies the Quantity Theory: 

M t+ 1 = pt.dt (16) 


Now consider the household’s consumption decision in this frame¬ 
work. 

OO 

It maximizes E t =o X) (d' u { c t) subject to 
t =o 


6t{xt) 


m p 1 

> -d- + Tt + r t (xt)st H- 

Pt Pt 

= Pt<k 


lt+i(xt+i)n(x t +i, x t )dx t +i 


a t \ Ptdt.St , \ > l t+i( x t+i) m p t -p t c t , . 

6 t+ i{x t +i) = - h rt+i{xt+i)s t + — -1-(17) 

Pt+i Pt +1 Pt+i 

Equation (17) states that the household’s beginning period real wea¬ 
lth, 6 t (xt), must be spent on money, taxes (r t ), and government claims 
(at current price n(xt + 1 , x t ) for contingency xt+ 1 ); that consumption 
can only be carried out by money; and that next period’s beginning 
wealth will be produced by this period’s income (in the form of money 
=Ptd t s t ) deflated by next period’s prices, next period’s value of the trees 
and government claims acquired this period, and the value of any money 
acquired by the household before the goods trading session but not used 
for consumption then. However our assumption that the nominal return 
on bonds is positive implies that there will be no such surplus money 
balance, because money is only useful for buying consumption goods, 
and the cash-in-advance constraint is binding: rn p = p t Ct- 
The consumer’s Lagrangean is then: 


OO 

L = E t = o Y, ftujct) - p t {(ct +T t + rt(x t )s t (18) 

t =o 

+ ~ J l p t+i(x t+ i)n(x t+1 , x t )dx t+ i - dt - 1 ^ lf ’ t ~ 1 ( 19 ) 

-r(x t )8 t -i - (20) 

Pt 

The first order conditions yield: 


(3u'(d t +i — gt+i)M t+ idt + i . 


(21) 



Money in Representative Agent Models 


317 


r t (x t ) 


[ f pu'(d t+1 — fft+i) \ (00 , 

J I u’idt-gt) /' 

jr t+ i(x t+ i) + 1 /(zt+i, x t )da; t+ i (23) 


Equation (19) is the price of a nominal bond which pays out when 
x t+ i occurs and it is set just as it was earlier, as the expected domes¬ 
tic marginal utility of consumption in that event relative to its current 
marginal utility; except that in this case it is the marginal utility of con¬ 
sumption of a nominal rather than a real unit that is assessed, so that it 
is deflated by the price level when x t +i occurs. (Of course Ct = d t — gt. 
by the market-clearing condition.) 

Equation (20) is the price of a real asset (the tree). Again the pricing 
method is the same as before but now the asset’s dividend is in mone¬ 
tary form ( pt.dt ) as it can only be exchanged for money. So apart from 
the expected future real price, J r t+ i(xt+i)f(x t+ i, x t )dx t+ 1 , its value 
depends on the expected value of the dividend received this period but 
spent next, f ^f(x t+1 , x t )dx t+1 . 

What effects does government policy have on this economy? 

The government’s budget constraint is: 


i 1 / , ( ^ ( n,j h{xt) M t+1 —M t 

g t = r t + — / lt + i{xt + i)n(xt +1 ,xt)dx t+ i ---1--- (24) 

/g j Pi 

Government policy consists of choosing sequences for g t , r t and M t , 
conditional on Xt : and consistently with (21). Now because in this econ¬ 
omy the output must be shared between government and private con¬ 
sumption, it is impossible for consumption to be affected by the pat¬ 
tern of taxation or money supply, given the government consumption 
sequence. However, the money supply sequence (and so also the tax¬ 
ation sequence if it is altered as a result) does affect the real value of 
real assets (trees). Hence there are real effects of monetary policy and 
taxation, because the returns from real assets can only be enjoyed by 
being exchanged for money and then spent. It follows that if there was 
investment in this economy, but the returns from investment could only 
be enjoyed by exchange for money, then private consumption would be 
affected by taxation and monetary policy: the mix between private in¬ 
vestment and consumption would be changed. Only if there were an 
investment vehicle (such as an indexed bond) offering consumption pos¬ 
sibilities quite independent of the price level would this cease to be the 
case. But such a vehicle may be ruled out in this cash-in-advance world 
(even on an indexed bond the indexed dividend has to be paid at a 
particular time to be exchanged later for goods in monetary exchange). 
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UNPLEASANT MONETARIST ARITHMETIC REVIS¬ 
ITED 

Let us return to the question addressed in chapter 7: whether tighter 
money today will reduce inflation permanently in the absence of changes 
in government spending and tax sequences. We can rewrite the govern¬ 
ment budget constraint: 

K = limit! + W g (a; , +i ,x.) (M<wl ~ Mw) rix, +j ( 25 ) 

Po ‘—{J Pt+J 

where K is the present value of the (g t — r t ) sequence and q(x t +j, x t ) = 
13 "£) +j) /(^+j. Xt ) the P resen t (t-period) real discounted value of 
a unit of consumption if x t .+j occurs (that is, it is a contingent real 
discount rate). 

Using the quantity theory, we can rewrite this: 

K = d 0 - — + Y] I <l( x t+j,x t )(d t+j - pt+j^dt+j^/pt+^dxt+j 
Po J 

( 26 ) 

If Mi is lowered, po falls (Mq is given), so ^ rises and there is a lower 
current inflation tax which must be offset by a higher future inflation tax 
(pt+j/pt+j- 1 ): but because q(x t +j, x t ) is of the order [3 J {ct+j being 
random and stationary), future inflation has to be higher by the order 

/ r j . 


POSSIBLE MODERN EVOLUTIONS OF MONEY 
AND MONETARY POLICY: THE REEMERGENCE 
OF COMMODITY MONEY? 

In chapter 6 we considered the evidence on optimal monetary (interest 
rate) feedback policy. We saw that rules in which monetary conditions 
reacted to deviations of inflation and output from their respective targets 
(of ‘low’ inflation and the natural rate) gave good results in conditions 
where people signed overlapping wage (or price) contracts of some given 
structure. However, the question arises whether these rules remain opti¬ 
mal when this structure is endogenous; we suggested there that this was 
not so but rather one found that rules offering price stability (that is, 
where any excess over the inflation target — which need not be zero — 
is clawed back in subsequent periods) produced a gain in macro stability 
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because people lengthened the period for which nominal contracts would 
be signed. A formal rule of this sort would be one with a price-level tar¬ 
get in which there was, for example, full clawback of a quarterly inflation 
error in the following quarter. This is examined in Minford et al. (2001) 
where the details can be found. The rule is analogous to a commodity 
standard in which if prices rise, implying that the relative price of the 
commodity, say gold, has fallen, then less gold is produced, driving the 
price up again and the general price level down again. With fiat money 
being targeted on the general price level in terms of money, the sup¬ 
ply of money would react more rapidly than gold production under the 
gold standard; for example, suppose that productivity rose, raising the 
supply of goods and so driving down the price level, then the money sup¬ 
ply would rise in the following period to accommodate this productivity 
increase, pushing prices back up again. 

If households are using their wage contract structure to help minimize 
the variance of their consumption pattern, then they should react to 
this greater certainty in prices by reducing the indexation element (or 
equivalently lengthening their contract period). This will mean (see 
figure 12.3) that the aggregate supply curve will be flatter (more nominal 
rigidity) and the aggregate demand curve will be steeper (less reaction 
of money demand to rising prices because less effect on wages; and so 
less monetary tightening caused by rising prices, so less fall in output 
and employment). This in turn implies that real shocks have less effect 
on both output and prices. So provided pure monetary shocks are kept 
low this is a recipe for general macro stability. 


CONCLUSIONS 

Money is a commodity used as a unit of account and a means of payment. 
Its evolutionary origin most likely lies in the emergence of a commodity 
with the right characteristics to avoid the double coincidence of wants 
implied by barter. Its evolution proceeded towards the modern com¬ 
petitive banking system in which money remains the unit of account, 
issued by a monopoly government, but its payments function has largely 
been taken over by clearing systems based on credit, in which people 
can earn interest on their assets as well. Its evolution continues today, 
with the monopoly role of government in deciding the unit of account 
possibly coming under competitive threat from other governments and 
even perhaps the private sector. The models we focus on in this chap¬ 
ter are intended to understand how money interacts with the economy 
at different stages of monetary evolution. The Bewley-Townsend model 
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AS (High Indexation) 



Figure 12.3: The effect of reduced indexation on slopes of AS and AD curves 
[<j> t = productivity shock; mt = monetary shock] 


explains how money permits two parallel (primitive) communities which 
are too remote from each other to lend directly to each other, to lend in¬ 
directly via money holding in the absence of interest-bearing instruments 
issued by a government or some trusted international intermediary. The 
OLG model shows how money can be a store of value which permits one 
generation to lend to the next indirectly again in the absence of bonds: 
however in a growing economy there are difficulties in achieving a mone¬ 
tary equilibrium which is Pareto optimal. The last model we looked at, 
Lucas’ model of cash-in-advance, simply assumes money is needed for 
transactions; given that, it models how an economy with its full panoply 
of other financial instruments would behave. Because returns on other 
assets have to be turned into money before they can be enjoyed, changes 
in monetary policy can alter the expected rate of return on these and 
so have real effects. Finally, we considered how money and monetary 
policy might develop; currently we observe overlapping wage (and price) 
contracts which make it optimal to pursue rules of monetary reaction to 
deviations from inflation and output targets. However, if contracts are 
endogenous, as surely they are over some time period, then it may well 
be desirable to pursue price-level stability; this lengthens contracts and 
creates greater automatic stability in the economy. 



13 


Open economy representative agent 
models 


In chapter 10 we explained open economy macroeconomics within a stan¬ 
dard macro model of the aggregate demand/supply sort. This model is 
widely used by macroeconomists for policy and forecasting. Neverthe¬ 
less it has no explicit micro-foundations and economists have wished to 
analyse open economy issues in the same way as in the models of the 
last two chapters. In this chapter we extend some of those models to 
the open economy. There is a huge degree of complexity possible in 
such models as a wide array of features special to the open economy can 
be incorporated (for a fall account see Obstfeld and Rogoff, 1996). We 
limit ourselves rather strictly here and aim to get the reader familiar 
with some essential elements. In particular we will invariably assume a 
small open economy facing world markets (one can then easily think of 
that world economy being effectively a closed economy as in the last two 
chapters, uninfluenced by its small neighbour). 


THE OPEN ECONOMY IN THE CASHLESS OLG MO¬ 
DEL 

A natural way to begin thinking about the open economy is in terms 
of overlapping generations trying to save for their old age. Whereas in 
chapter 11 this was only possible if the government stepped in with its 
borrowing, with the open economy it is possible without a government, 
through the balance of payments. Let there be one sort of perishable 
good traded throughout the world. We adopt the OLG model of chapter 
11 but now allow lending overseas by the young generation, l f t l . which 
is then repaid to it when old — as before the old must consume all the 
resources they have. 
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The household maximizes the logarithmic utility function 

+ !)) = ln + (1) 

subject to y — e — Cf(t) = and e + [1 + r(t)]Z* = c^{t + 1), where 
r(t) is the world net rate of interest, exogenous to this economy. Its 
Lagrangean is therefore: 


J = In c’t ( t ) + Incj *(t + 1) + $ (t) [y-e- c h t (t) - %] 

+ + 1 ){ £ + [(1 + - c t(t + 1 )} ( 2 ) 


The first-order conditions yield: 

h _ c h t {t + l) 

Ct[t) ~ 1 +r(t) 

The consumer’s life-time constraint is: 

h . . c! l (t +1) , £ 

Ct{t)+ TTm ={y ~ e)+r nw) 


so that 


and 


h , . y — e e 

ct(t) = ^r- + 


2 2(1 +r(t)) 


l h = y -£ e/2 


2 1 + r(i) 


( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 


We now consider market clearing in the open economy when there is 
no money, only credit notes denominated in terms of the single good. Our 
small economy residents can make a loan to foreigners, allowing them 
to consume home endowments in return for repayment with interest 
next period; any repaid loans’ proceeds will be used on consumption of 
foreigners’ endowments. It is plain that any net loans made will be equal 
to net home endowments not consumed at home, that is, net exports: 
we assume in the standard small-economy way that the net loan and net 
export supplies by our residents make no difference to the world price of 
loans (the interest rate) or of the single good. So market clearing for our 
small economy is automatic at world prices via the balance of payments 
which gives: 

It ~ It-iO-+ r(t - 1)) = y — $(t) - c/_!(t) (7) 


One can imagine these external transactions taking place on the 
beach where foreigners come to trade; all sales by young residents are 
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settled by credit notes and all credit notes held by old residents are 
repaid with interest by supplies of goods. 

We can now consider the behaviour of our small country’s balance of 
payments on the assumption that the world interest rate is constant at 
r. With each generation’s endowments in youth and old age constant, 
each young generation will consume and lend the same as the next: 
hence l = l£_ 1 = l h . Also c^{t) = c h and c/(t + 1) = c h (l + r) are 
constant.Since the population of each generation is assumed in (7) to be 
constant at N, it follows from (7) that the current account of the balance 
of payments, 

y - c h - c h { 1 + r) + rl h = l h t - l h t _ x = 0 (8) 

is in continuous balance. 

However this would plainly alter if there was growth, say in pop¬ 
ulation N at the rate n, still assuming constant r. In this case each 
generation has a constant per capita consumption when young and old, 
and so also constant borrowing when young — the same as in equation 
(8). But each generation is (1 +n) times the previous one in size.Then 
the balance of payments equation would become: 

(1 + n)l h - l h (l + r) = (1 + n){y - e - c h ) + e-c h ( 1 + r) (9) 

What this shows is that this country will have a persistent current 
account balance of payments surplus of, using (6): 

+ ‘ 10 > 

This is because the young are always more in number than the old so 
that their aggregate savings exceed the aggregate dissaving of the old 
(being only equal when their numbers are equal). It follows by the same 
argument that if population were declining then the country would be 
in persistent deficit. 

What we see is that the focus of this model is on the dynamics of 
population and the endowment. The balance of payments is an auto¬ 
matic financing mechanism about which there is no particular concern; 
its surpluses and deficits will reflect these underlying dynamic factors. 

Suppose we return to the constant population/endowment model and 
examine whether the government could create problems for the balance 
of payments. The government’s budget constraint is: 

G(t) + L 9 (t) = Yl T tS) +X]r* t ( t ) + [ 1 + r(t-~l)\L 9 (t-l) (11) 

where (f) an d are the f-period lump-sum taxes raised 

respectively on the t — 1 and t generations. 
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The household’s budget constraint now must deduct the taxes pay¬ 
able from its income in each period: 


and 


c h t {t) 


y~£~T%(t) E-T^t+l) 
2 2(1 + r(t)) 


r = 


y-e- T?(t) _ e-T%(t + 1 ) 

2 2(1 + r(t)) 


Now the balance of payments becomes: 


( 12 ) 


(13) 


LP(t) + l* - Iti ~ L 9 (t— 1) = y- c£(t) - G(t) - + 

+ L°(t- 1)) (14) 


with the capital account on the left-hand side of the equal sign and the 
current account on the right. From this we can see that if the taxes 
collected on the young and on the old each remain constant, like en¬ 
dowments, population and interest rates, then households will lend a 
constant amount (Z^_ 1 = l t l ), and the capital and current accounts will 
be in balance if the government balances its budget (Lf = L 9 (t — 1)). 
The difference then from the closed economy OLG model is that now the 
government has no beneficial role in borrowing because the private sec¬ 
tor can smooth its consumption without its assistance. Nevertheless of 
course if the government borrows and so creates a current account deficit, 
it means it must repay it by reduced spending or extra taxes later; there 
is no ‘balance of payments problem’ unless there is a possibility that 
it will not. But this in turn would be spotted by foreign lenders offer¬ 
ing the loans so that what one sees here is a process of smoothing of 
household and government spending undertaken by foreign lenders in 
an entirely voluntary, self-enforcing process. It is not difficult to embed 
temporary or permanent shocks into this model to see the smoothing 
effect via balance of payments deficits and surpluses. 


AN OPEN CASH-IN-ADVANCE MODEL WITH DY¬ 
NASTIC HOUSEHOLD 

The OLG model focuses on longer-term savings decisions and so high¬ 
lights the role of the balance of payments in smoothing the effects of 
long-lived shocks to population and GDP. For short-term analysis of the 
business cycle however it is not much help. For this we naturally turn 
to one of the models of chapter 12, Lucas’ casli-in-advance model, where 
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the household is assumed to be infinitely-lived (that is, to be ‘dynastic’ 
in that it cares about the welfare of its descendants). We retain the basic 
set-up that households, government and foreigners decide in the first half 
of each period on all financial transactions, including the acquisition of 
money to buy goods in the home economy and now foreign money to buy 
foreign goods. We now add a second, foreign, ‘country’ — the rest of the 
world. The home and foreign goods are not, as in our OLG model, the 
same; they compete for the custom of households in the two countries. 

Each country has a stock of trees, one per capita. We will allow these 
trees to be owned by foreigners (capital movements), in fractions, so that 
in effect ‘equity’ can be held in someone else’s tree — one purchases a 
fraction of it. This is of no consequence in a closed economy where 
everyone is identical and so ends up with the same tree; however in the 
open economy people can buy parts of trees in other countries and as 
each country has different conditions this may well happen. 

The representative household in the home country maximizes in pe¬ 
riod 0 for example a logarithmic utility function in consumption of home 
and foreign goods and of leisure: 

OO 

Uo = E 0 (Inct + In c' t + aln l t ) (15) 

t=o 

subject to 

Ptc't Pt r t / 

PtCt H- \~ Ptrt St H- s t — pt-iyt-i + ptTtSt— 1 + 

&t &t 

[~ e "f +Pt- i(l - Ci)X-i} 4-i +Tt (16) 

yt = (l - ItYdtSt (17) 

m t > pt.c t (18) 

m't > Pt c ' t (19) 

where an asterisk denotes ‘foreign’ and a prime denotes the demand by a 
home resident for something foreign; c is consumption (hence d is home 
consumption of the foreign good); l is leisure; m is demand for home 
money (m! of foreign money); s is the demand for trees and r is their 
real price; p is prices; e the exchange rate (foreign currency per unit 
of home currency); y is output; d is the fruit crop of the tree when the 
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household is working at full stretch (l = 0); T is the money transfer from 
the government. 

We use the same sequencing as in chapter 12; the household has to 
use the proceeds of income in the last period (plus its government trans¬ 
fer and asset disposals) to acquire goods this period; income depends 
on work, with elasticity n. The consumer makes plans to hold money 
and assets during the financial sub-period. We assume the leisure plan 
is then carried out during the production-and-shopping subperiod that 
follows: shocks to the fruit yield are revealed in this subperiod so that 
consumption plans may be frustrated by price changes. 

Assuming as usual that money’s zero nominal return is dominated 
by the expected nominal return on trees, so that the cash-in-advance 
constraint is binding, the first-order conditions are: 


c* = Wh 
<h Pt 


( 20 ) 


[p t ( 1 - l t yd t +p t+ ir t + 1 ] 
tit - 

npt 


r Ct+lPt+l 


( 21 ) 


a (3Trptdt(l - kY 1 s t 

Y — ti t - 

H Pt+lCt+1 


( 22 ) 


Et 


f Pt+i i 


t+i 


+ pl(l-l*tYd* t 
Pt r t 


e t = E f Pt+in+i +p t ( l - itYY \ 
et +1 * 1 Ptti J 

(23) 


the first is the home/foreign goods consumption trade-off; the second 
that of present versus future consumption; the third that between leisure 
and future consumption; the last is that between home and foreign trees 
(uncovered interest parity in nominal terms). 

The foreign household has exactly analogous utility and first-order 
conditions. 

The government’s budget constraint is: 


M t - M t _i = T t (24) 

Market clearing gives us: 

s t = 1 = s* t (25) 
M t = m t + vn*t = p t (c t + c* t ') = p t d t (26) 

M t * = m* t + m't = pi {cl +c't)= p* t dl (27) 
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m*t' + ptTtASf' — pt- i(l — lt-i) n — —- H—— As' t — 

e-t e t 

Pt- l(l ~ /2g\ 

e* 

This last is the balance of payments constraint, that foreigners’ de¬ 
mand for home money (home exports) and extra home trees minus their 
earnings from their previous stock of home trees be equal in terms of the 
same currency to home households’ demand for foreign money (home 
imports) and extra foreign trees minus their earnings from their previ¬ 
ous stock of foreign trees. It shows that there is scope for one economy 
to smooth its consumption for example in the face of a poor crop by 
selling shares of its trees to foreigners; buying them back later in a good 
year. This model is in fact rather similar to that of chapter 10, even 
though the latter had no micro-foundations. It is more complex to solve 
both because of its non-linearity and because it is a two-country case; to 
solve it analytically would require either linearising or loglinearising it. 
(A simpler version without capital flows is solved analytically with some 
loglinear approximation in Minford, 1995). However our main purpose 
here is to show its structure. 


A SMALL-COUNTRY VERSION 

Now let us simplify this to make it possible to derive a tractable analytic 
solution. First, let us treat leisure as a constant; then let both fruit yield 
and the money supply be random variables around a constant mean, 

dt = d( 1 + £t ) (29) 

and 

Alt = M(1 + T] t ) (30) 

Then rewrite contemporaneous utility as 

u t = lnu t where v t = {c)7 p + ac t ^ p }~ (31) 

OO 

so that Uq = E 0 E /3*ln v t (32) 

t =o 

Third, turn this into a small-country model, by assuming that the 
Rest of the World (the other country) consumes a fixed proportion of 
the home country’s produce, x*, and does not hold its trees; its prices 
of goods, p*, and of trees, r*, and its tree yield, d *, are exogenous (and 
constant). 
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This implies the new first-order conditions are: 

(J 

= (aRXRtY (33) 

Ct 


(- 24 -dt 

£ K pt +1 


n+i) 


n 


c* 


(34) 


, [^^+ r *+i] _ „ / r* + d* \ 

* D " ^ I r* 


(35) 


where <r = -j-j— and RXR t = is the real exchange rate. (34) is 
uncovered interest parity in real terms, obtained from (23) which is in 
nominal terms in the manner of chapter 10, equation (9). 

The home country’s residents now only obtain foreign goods over and 
above their exports and the return on their foreign trees, by selling some 
foreign trees, so the balance of payments equation becomes: 


?nl 


&t 


r *P*( s t 


’t- 1 ) 


e t 


d*p*s' t _ 1 


*/ * 7 

= m t = p t x d t 


and so 


c' t + r*As' t - d*s' t _ 1 = RXR t x*d t 
Market-clearing implies 

M t = m t = p t (ct + x *d t ) = p t d t 

so that 


Ct = d t { 1 - x*) 


and 


Pt = 


Mt 

dt 


as before. 

The import function is from (32) and (38): 

c' t = {aRXRtYc t = {aRXR t Yd t { 1 - x*) 


(36) 

(37) 

(38) 

(39) 

(40) 

(41) 


It follows that 


v t =Ct[l + {aRXR t r ap r L r 


(42) 
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Equating the right hand sides of (33) and (34), and using (31) and 
(41), we obtain: 


171 fl+(aRXR t+l )~''» 

l + (aRXR t )-<rp 


rH^r ) p+1 


E t ( 


RXR t 

RXRt+i 


(3 


{ r* + d* 
V r* 


(43) 


Leading this one period forwards but taking expectations at t still, 
we would find: 


rp / l+(a-RVflf^2) rTp 
rjt yi+(aRXR t+1 )-<'P 



7 -i / RX R t +1 \ 
^tyRXRt+J 


(3 


(^) 


+ Co 


(44) 


where Co is (minus) the constant of a Taylor series second order approx¬ 
imation of the left-hand side (in d t +i and d t + 2 ) around d . From (43) it 
is plain that RXR must be constant from t +1 onwards if we assume for 
simplicity that /3( r ) = 1, that is, domestic time preference is equal 

to the world rate of interest) and that Co is small enough to neglect. The 
left-hand side of (43) is unity with a constant RXR ; but if RXR were 
expected to rise (fall) then the left hand side would be greater (less) than 
unity. To find out this constant value, we can solve backwards from the 
terminal value (at T, T —> 00 ) when the transversality condition that 
Asy = 0 and dr = d is met; then we can solve from the balance of 
payments condition (36) that: 


c' T = RXR T x*d + d*s' T _ 1 


(45) 


Using the import function (40) yields us 

{aRXR T ) a ( 1 - x*)d = RXR T x*d + d*s' T _ 1 (46) 


Linearising the left hand side around the presumed equilibrium RXR 
we obtain: 


cr(l - x* )d(aRX f?)^ 1 RXR T + f 0 = RXR T x*d + d*s , T _ 1 (47) 


where /o is a constant; so that 


RXR t 


d*s' T _ 1 /o 

d{n(l — x*)(aRXR) a ~ 1 — x*} 


(48) 


In this we assume that a is large enough for the denominator to be 
positive; this is required for stability, as otherwise a rising real exchange 
rate (relative home to foreign prices) would cause the current account 
to improve. Since from period t + 1 the real exchange rate is constant it 
follows that 


RXRf _(_j — — 


d*s' T _ 1 - f 0 


d{a( 1 — x*)(aRXR)° 


x*} 


for z = 1,2,..., T 


(49) 
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Going back to (42), rewrite it as 

E t ( 1 + (uRXRt + i)~ ap )-p __ 

E t(wck T7) 

(3 d p t +1 RXRt{l + ( aRXR t )- ap }-p + Cl 

where ci analogously to co results from the Taylor Series approximation 
(for the terms in d t+ 1 ) around d. We can see in this equation that with 
the left-hand side now fixed, a positive shock to GDP lowers the real 
exchange rate for one period and also by (34) the real return on home 
trees. Net exports rise (the rise in output raises exports and imports by 
similar amounts assuming that x* and a are of similar size; the fall in 
RXR then assures that imports will fall back from this level), so that 
foreign trees are acquired. The following period the real exchange rate 
rises back to a permanent level higher than the pre-shock rate, since 
foreign stocks of trees are permanently higher. 

This example under our assumptions gives the typical boom/bust 
cycle; because the extra output cannot all be sold abroad more has to 
be sold domestically which forces people to spend more generally this 
period (this is effected by the falling real exchange rate driving down the 
implicit home real interest rate, made up of the world real rate minus 
the expected appreciation of the real exchange rate). 

The role of money in this model is restricted to determining, apart 
from the current price level, the current real price of trees. Money does 
not affect the real exchange rate, which is a limitation on this model. 
We could give it a bigger role by reintroducing a leisure reaction as in 
the previous model; or we could introduce nominal wage/price rigidity 
via overlapping wage contracts for example as discussed extensively in 
earlier chapters for the closed economy. Obstfeld and Rogoff (1996) set 
out a representative agent model with menu costs of price setting in 
their last chapter; the characteristics of this model are quite like those 
of the Dornbusch-style models of chapter 10 above but the complexity 
required to obtain them is a huge order higher. Increasingly economists 
are Ending that the ‘deep structure’ models required to model the world 
in a useful way are far too complex to handle in practice; this then 
impels them to write down a set of linear approximations, typically of the 
IS/LM/Phillips-curve form, which are derived from the deep structure 
model. But of course this was exactly what the original proponents 
of these models believed they were doing originally — that is, deriving 
from some theoretical structure some useable macro approximations. We 
seem to have come full circle! It is good to know one can derive these 
IS/LM models from micro-foundations but having done so the practical 



Open Economy Representative Agent Models 


331 


policy maker will want mainly to use models like those of chapter 10 or 
versions with more elaborate nominal rigidities. 


CONCLUSIONS 

We have set out here a variety of deep structure open economy models to 
give an idea of how they are built and what the balance of payments is 
doing in a deep structure way — viz helping an open economy to smooth 
consumption and to engage in arbitrage across world markets. Such 
models are useful exercises for understanding the underlying processes 
involved in open economy macroeconomics. However in practice except 
for policy or simulation exercises involving large-scale regime changes 
it is usual for economists to use the models of the type of chapter 10, 
which have their origins in the Mundell-Fleming model of the 1960s but 
have since been ‘stretched’ to incorporate the modern developments of 
rational expectations, supply functions, and nominal rigidities in wage- 
price setting. 



14 

Testing the Rational Expectations 
Hypothesis 


In the remainder of this book, we consider the ways in which the RE 
hypothesis has been tested empirically. We begin in this chapter with 
the efficient market hypothesis, which is a joint hypothesis made up 
of rational expectations and a hypothesis about how expected returns 
are determined. In chapter 15, we discuss the evidence assembled by 
Robert Barro and others to test whether output is determined by the 
New Classical model or not. In chapter 16, we review direct tests of 
rational expectations using survey data on actual expectations; we then 
turn to the estimation of complete rational expectations models. Some 
such models are ‘deep structure’ (where the parameters are those of 
tastes and technology, presumably invariant to changes in the processes 
driving the exogenous variables including policy) but most are ‘shallow 
structure’, the parameters being those of aggregate supply and demand 
curves. 

The purpose of this chapter is to consider the relationship between 
the concepts of financial market efficiency and rational expectations. 
A particular feature of financial markets is that trading can occur, in 
principle, almost continuously, and the market price is free to move to 
eradicate any imbalance between demand and supply. Furthermore since 
the assets traded (stocks, bonds, commodities) can be resold or traded in 
future periods it follows that financial markets are, more obviously than 
others, speculative in the technical sense that expectations of future asset 
prices affect current asset prices. 

We begin this chapter by defining the concept of an efficient capital 
market and setting out models for the determination of normal or equi¬ 
librium asset returns and also the implications of rational expectations 
for the determination of asset prices. A brief review of the empirical 
evidence for efficiency in asset markets together with the problems faced 
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in interpretation of the empirical evidence is also presented. 


EFFICIENCY, PERFECTION AND ASSET PRICING 
IN CAPITAL MARKETS. 

A capital or asset market is defined to be efficient when prices (e.g. stock 
prices, bond prices or exchange rates) fully and instantaneously reflect 
all available relevant information. Fama (1970) has defined three types 
of market efficiency, according to the extent of the information reflected 
in the market: 

1. Weak-form efficient: A market is weak-form efficient if it is not 
possible for a trader to make abnormal returns by developing a 
trading rule based on the past history of prices or returns. 

2. Semi-strong-form efficient: A market where a trader cannot make 
abnormal returns using a trading rule based on publicly available 
information. Examples of publicly available information are past 
money supply data, company financial accounts, or tipsters in pe¬ 
riodicals. 

3. Strong-form efficient: Where a trader cannot make abnormal re¬ 
turns using a trading rule based on any information source, whether 
public or private. 

These three forms of efficiency represent a crude partitioning of all 
possible information systems into three broad categories, the precise 
boundaries of which are not easily defined. However, they are useful, 
as we shall see, for classifying empirical research on market efficiency. 
As their names suggest, strong-form efficiency implies semi-strong effi¬ 
ciency which in turn implies weak-form efficiency, while of course the 
reverse implications do not hold. 

It is useful to distinguish between the concept of an efficient capital 
market and that of a perfect capital market. A perfect capital mar¬ 
ket could be defined as one in which the following conditions hold (see 
Copeland and Weston, 1988): 

1. Markets are informationally efficient, that is information is costless 
and it is received simultaneously by individuals. 

2. Markets are frictionless, that is there are no transactions costs or 
taxes, assets are perfectly divisible and marketable and there are 
no constraining regulations. 
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3. There is perfect competition in product and securities markets, 
that is agents are price-takers. 

4. All individuals are rational expected-utility maximizers. 

If conditions 1 to 4 were met (and assuming no significant distortions 
elsewhere in the economy), the capital market would be allocationally 
efficient, in that prices would be set to equate the marginal rates of re¬ 
turn for all producers and savers, and of course consequently savings are 
optimally allocated. The notion of capital market efficiency is therefore 
much less restrictive. An element of imperfect competition in product 
markets would imply capital market imperfection; nevertheless, the stock 
market could determine a security price which fully reflected the present 
value of the stream of expected future monopoly profits. Consequently 
the stock market could still be efficient in the presence of imperfection. 

Asset prices, in order to give the correct signals to traders, must 
fully and instantaneously reflect all available information. However, as 
pointed out by Grossman and Stiglitz (1976, 1980), it cannot be the case 
that market prices do fully and instantaneously reflect all available infor¬ 
mation. If this were so, agents would have no incentive for collecting and 
processing information, since it would already be reflected in the price, 
which each individual is assumed to be able to observe costlessly. It is 
the possibility of obtaining abnormal profits in the course of arbitraging 
which provides the incentive to collect and process new information. In 
the Grossman and Stiglitz model, individuals choose to become informed 
or remain uninformed, and in equilibrium each individual is indifferent 
between remaining uninformed on the one hand, and collecting infor¬ 
mation (or buying the expertise of brokers), so becoming informed, on 
the other. This is because after deducting information costs each action 
offers the same expected utility. 1 


1 Hellwig (1982) challenges the Grossman and Stiglitz proposition that the infor- 
mativeness of market prices in equilibrium is bounded away from full informational 
efficiency. Hellwig points out that this proposition rests on the assumption that 
agents learn from current prices at which transactions have actually been completed. 
This is a model in which investors learn from past equilibrium prices but not from 
the auctioneer’s current price offer. Hellwig is able to show that if the time span 
between successive market transactions is short, the market can approximate full in¬ 
formational efficiency arbitrarily closely and yet the return to being informed remains 
bounded away from zero. This results from the fact that informed agents can uti¬ 
lize this information before uninformed agents have an opportunity to infer it from 
current market prices. 

Hellwig also pursues the implications that arise if one relaxes the assumption that 
agents cannot assure themselves of being informed in a given period, but rather agents 
choose the frequency on average at which they obtain information. It appears that 
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Nevertheless, a reasonable interpretation of empirical tests of the 
efficient model hypothesis is, given that the data are collected at discrete 
intervals, that the process of arbitrage has occurred within the period. 
Consequently the implications of different available information can then 
be analysed without modelling the process of arbitrage itself (Begg, 1982) 
and this is the usual assumption in empirical work. 


RATIONAL EXPECTATIONS AND MARKET EFFI¬ 
CIENCY 

The semi-strong-form efficient markets model, i.e. that based on pub¬ 
licly available information, is an application of the concept of rational 
expectations, although this was not stressed in the early literature on 
efficiency, which goes back much further than the rational expectations 
literature. If expectations are non-rational, then publicly available in¬ 
formation will not be reflected in asset prices and systematic abnormal 
profit opportunities will be available. This can be seen simply enough by 
noting that market agents have to know the model governing prices (or 
act as if they know it) in order to eliminate abnormal expected returns; 
if the model governing expected prices is different from that governing 
actual prices, there will be systematic abnormal returns available in the 
market. 

Strong-form efficiency also implies that agents have rational expec¬ 
tations, since they must know how to use all sorts of private information 
as well as public; where strong-form efficiency differs from semi-strong is 
about the effects of private information on the market (fully discounted 
in strong-form, not at all in semi-strong). 

Not quite the same is true of weak-form efficiency. In this case, since 
they make efficient use only of the past history of prices, they must know 
the time series model governing prices; strictly speaking this does not 
imply knowledge of the underlying structural model, since there will be 
generally insufficient identifying restrictions. Nevertheless, in practice, 
with limited samples and structural change, recovery of the time-series 
parameters by market agents from the data can effectively be ruled out. 
It is therefore natural, if not necessary, to assume in this case too that 
agents have rational expectations and so know the underlying model, 
from which they are then able to derive the time-series parameters. 

While, therefore, market efficiency can be regarded as implying ratio- 


relaxing such assumptions leaves his main result above unimpaired — see also the 
survey by Jordan and Radner (1982). 
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nal expectations, rational expectations does not imply market efficiency. 
Market efficiency is a joint hypothesis about expectations and market 
behaviour (specifically, the model of equilibrium expected returns). The 
main hypothesis about behaviour is the capital asset pricing model (see 
below); and in empirical tests more detailed assumptions must also be 
made about how equilibrium returns will move. Further hypotheses con¬ 
cern the behaviour of the agents with access to different sets of ‘available’ 
information. Under weak-form efficiency, active market participants are 
assumed to make effective use only of the past history of prices in their 
market; one theoretical basis for this has been in the costs of obtaining 
and processing wider information (Feige and Pierce, 1976). Under semi- 
strong-form efficiency, the assumption made is the usual one in rational 
expectations macro models that active agents use all publicly available 
information, presumed to be useable at zero or low cost. Finally, in the 
strong-form case it is assumed that those agents with access to private 
information deploy or indirectly influence funds to eliminate expected 
returns from this source of information; there are, however, problems 
with this, since private information cannot be sold at a fair price (once 
divulged it is valueless, but before divulgence it is impossible for the 
buyer to assess) and those with access have, by definition, in general 
only limited funds. 

Because all definitions of market efficiency invoke the concept of ab¬ 
normal returns, we are required for empirical work to have a theory of 
the equilibrium expected rate of return for assets. Tests of market effi¬ 
ciency are conducted after allowance for the equilibrium rate of return. 
If the riskiness of an asset does not change over time (or conversely if its 
risk changes randomly over time) then, for example, weak-form efficiency 
implies that there should not be an extrapolative pattern in the time se¬ 
ries of returns. If there were a recurring pattern of any type, traders 
who recognized the pattern would use it to make abnormal profits. The 
very effort to use such patterns would, under the efficiency hypothesis, 
lead to their elimination. 


THE CAPM MODEL OF EQUILIBRIUM EXPECTED 
RETURNS 

The equilibrium expected return on assets is a central topic in modern 
portfolio theory and the interested reader is directed to, for example, 
Copeland and Weston (1988) for a full discussion (Allen, 1985, provides 
a helpful introduction). The following brief account of one such and 
widely used theory, the capital asset pricing model (CAPM), must suffice 
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here. 

Only if traders are risk-neutral 2 will they be indifferent to the vari¬ 
ability of the returns (i.e. the risk) on their portfolio. Risk-averse indi¬ 
viduals will be concerned about aggregate portfolio risk and will require 
a risk premium on each asset (or class of assets). By combining assets 
in the portfolio it is possible to diversify away some of the risk (the ‘un¬ 
systematic’ risk) associated with an asset. However, to the extent that 
the returns on an asset move with the market, there will be a compo¬ 
nent of risk (systematic risk) that cannot be diversified away. Assuming 
optimal portfolio diversification, the risk premium reflects the asset’s 
systematic risk and hence its contribution to the overall variability of re¬ 
turns on the portfolio. This premium will be included in the equilibrium 
expected rate of return on this asset, in addition to the general rate of 
return on the portfolio. 

The algebra of the CAPM theory is straightforward. All investors are 
assumed, as in standard portfolio analysis, to maximize their expected 
utility subject to their portfolio wealth constraint: this yields for each 
investor an optimally diversified holding of each asset in terms of the 
expected returns on all assets. It is convenient to aggregate this equation 
across investors and refer to the result as the asset demand of a ‘typical’ 
investor (but this does not imply that all investors are the same; it is 
merely an expression indicating averaging). The CAPM’s contribution 
is then to note that at any time there will be a certain stock of each 
asset outstanding in the market which must be held in equilibrium. To 
be held, its expected return (and so its price) must satisfy the asset 
demand equation of the typical investor: in other words, this equation 
is turned round and solved for the necessary expected return in terms 
of the outstanding asset quantity. As the quantity of each asset rises, 
however risky it may be, it contributes only a little to the total risk on 
the whole market portfolio because it forms only a small part of this 
diversified whole; hence the rise in necessary expected return (fall in 
price) is negligible and we can talk of ‘the’ required expected return on 
an asset, independently of its quantity outstanding. Clearly this must 
not be pushed too far: some assets (e.g. dollar liabilities in currency 


2 Consider an individual faced with a possible gamble; he may choose either to 
receive 100 for sure, or to toss a coin and receive 50 if heads occur and 150 if tails 
occur. The expected outcome of this latter choice is 100 = 0.5(50) + 0.5(150). The 
question is: will the individual prefer the actuarial value of the gamble (this it its 
expected outcome) with certainty or will he prefer the gamble itself? If he prefers the 
gamble he is a risk lover; if he is indifferent he is risk neutral; and if he prefers the 
sure outcome he is risk averse. It is also possible to compute the maximum amount 
of wealth an individual would be willing to give up in order to avoid the gamble. This 
is the notion of a risk premium (see Pratt, 1964; Arrow, 1971; Markowitz, 1959). 
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portfolios) are outstanding in very large quantities and their risk cannot 
be fully diversified away. 

Assume three assets, one of which yields a safe return . The typ¬ 
ical investor maximizes the expected utility of end-of-period wealth, 
E t U(W t+ i), with respect to uq and w 2 subject to the budget constraint 

W t+ 1 = Wl R it + iv 2 R- 2 t + (1 - wi - w 2 )Rt (1) 

where Wi is the share in the investor’s portfolio of asset i, Ru is the 
actual return during the period, W t +iis his end-period wealth expressed 
as an index (beginning period wealth = 1), U is his utility function 
(U' > 0, U" < 0), and E t is his expectation formed at the beginning of 
the period when he takes his investment decision. 

Take a Taylor series expansion of U(W t + 1 ) around E t (W t + 1 ): 

U(W t+1 ) = U(E t W t+1 ) + U'.(W t +1 - EtW t+ 1) + 

0.5 U".E t (W t+1 -E t W t+1 ) 2 + .... (2) 

Ignoring terms of higher order than two, (see e.g. Kraus and Litzen- 
berger (1976), Hwang and Satchell (1997) or Harvey and Siddique (2000) 
for some implications of higher moments) or alternatively assuming that 
the agent’s utility function only depends on the first two moments, the 
expectation of this is: 

E t U(W t+1 ) = U(E t W t+1 ) + U'.(E t W t+1 - EtWt+x) + 

0.5 U".E t (W t+1 ~E t W t+1 j’ (3) 

which is equal to 

U(w\E t R u + w 2 EtR 2 t + {1 - w\ - w 2 }R t ) + 

0.5 U"(w\a\ + w^a\ + 2w\w 2 ai 2 ) (4) 

where = E t ([R it — E t R it ][Rj t — E t Rj t ]), the covariance between the 
returns of assets i and j, and of = the variance of V s return. 

The first order conditions with respect to uq and w 2 respectively are: 

U'[E t R lt - R t ] = U"[ Wl ai + cj V2 w 2 \ (5) 

and 

U'[E t R 2t - Rt] = U"[w 2 u\ + a 12Wl ] (6) 

-U" 

= P 


U' 


( 7 ) 
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p is defined as a measure of absolute risk aversion. 

We can express (5) and (6) in the CAPM form with required expected 
asset returns as the dependent variable (the standard portfolio analysis 
has the asset shares as the dependent variables): w\ and w 2 are given 
by the outstanding stocks of assets 1, 2 and the safe asset in the overall 
market. 

Rearranging (5) the expected return on asset 1 for example is : 

E t Rit = R t +p{wial + w 2 a 12 ) = R t +pw{x 1 [al - <t 12 ] + a 12 ) (8) 

and on asset 2 

E t R- 2 t = Rt +p{w 2 a\ +w 1 a V2 ) = R t + pw{x 2 [tj\ - <ti 2 ] +<ti 2 ) (9) 


where X\ is the share of asset 1 in the risky part of the market portfolio, 
x-\ = —, x 2 = —, and w is the share of the risky part in the total 
market (w = wq + w 2 ). Suppose we regard asset 1 as being a single 
asset and asset 2 as being a portfolio of all assets in the risky market, 
that is essentially ‘the’ risky market portfolio. Then (8) reveals that 
the expected return on a single asset consists of the safe return plus a 
risk premium reflecting risk aversion (p ), the overall share of risky assets 
in the whole market ( w ) and the covariance between asset 1 and the 
risky market (cti 2 ), ‘systematic risk’: there is also a small component 
in the risk premium for the extent to which risk on asset 1 exceeds this 
covariance (‘diversifiable risk’), but this has a negligible effect because 
it is multiplied by x\, the small share of asset 1 in the risky market. 
So provided asset 1 is small in relation to the market, its risk can be 
virtually totally diversified away and barely affects the expected return. 

Suppose we now let asset 2 embrace the whole risky market. Then 
x 2 = 1 and we have from (9): 

E t R mt = Rt+ pwu 2 m (10) 

where m is the whole risky portfolio so that in this case g\ = cr^. From 
this it follows that: 


pw = 


(EtRmt — Rt) 


a 


2 

m 


(ii) 


This means we can rewrite asset l’s required return, substituting (10) 
into (8), as 


EtRit — Rt + 


{xi[aj - aim] + ai m } 


(EtRmt — Rt) 


( 12 ) 


or more generally for any asset 


EtRit — Rt + Pi(Et.R m t — Rt) 


(13) 
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where we assume the weight x. is small enough to be ignored and so 


Pi 


(14) 


E t R m t — Rt is the excess return required on a unit of the average port¬ 
folio: it is therefore ‘the cost of average risk’. Individual assets com¬ 
mand higher or lower excess returns according to their ‘beta’, /?,, which 
measures their systematic risk, the covariance of the ith asset with the 
market portfolio, divided by the variance of the market portfolio — as 
illustrated in figure 14.1. 



Figure 14.1: The Pricing of a Security 


OTHER MODELS OF EQUILIBRIUM EXPECTED 
RETURNS 

A related idea to the CAPM assumes that aggregate market risk is dom¬ 
inated by several sources of risk: principal component analysis can be 
used to separate these out. Thus there is not one but several market 
‘risks’ or variables which are uncorrelated (for example, one variable 
might be a measure of the world business cycle, another might be the 
North-South terms of trade): the expected return associated with tak¬ 
ing each risk is priced exactly as above — now think of ‘asset 1’ and 
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‘asset 2’ as being respectively ‘exposure’ to ‘risk 1’ and ‘risk 2’: that is, 
you buy £1 worth of assets combined in such a way that their return is 
correlated perfectly with variable 1, and similarly for variable 2. These 
fundamental risks being priced, each asset is then priced, by arbitrage, 
according to its amount of each risk. This is the arbitrage pricing theory 
(APT), due to Ross (1976). 


CONSUMPTION CAPM 


In chapters 11 and 12 we set out Lucas’ theory of asset pricing. The 
first-order condition for the optimal consumption and portfolio decision 
is given by equation (9) in chapter 11 as 

u'{c t ) = E t (3{ 1 + R it )u'(c t+ 1 ) (15) 


where we have generalised the model to allow for the real gross return, 
1 + Rit, on the ith asset’ 3 

Recalling that the covariance of X and Y denoted by covXY is by 
definition covXY = E{X - EX).(Y - EY) = EXY - EX.EY (we 
also note in passing that cov{ 1 + X, Y) = covXY), we can write the 
unconditional form of (15) as 


E( 1 + Ru) — 


(1 — cov(l + R it .K t+ i)) 
EK t+1 


(16) 


3 If we write the budget constraint as 
ct + ptst+i + qth+l = (Pt + dt)s t + b t 

where bt represents holdings of the riskless asset, st represents holdings of the risky 
asset, pt is the price of the risky asset, qt is the normalised price of the riskless asset. 
The optimization yields 


o 

II 

< 

1 

o 

(1) 

—A tqt + f3Et\t+i = 0 

(2) 

A tpt + 0Et(pt +1 + dt+i)At-|_i = 0 

(3) 


where A t is the Lagrange multiplier. 
These conditions, 1-3, yield 


1 

1 


0E t 

pE t 


u'(c t+ l) 

qtu'(ct) 

(Pt +1 + d t+ i)u'(c t+ i) 
ptu'(ct) 


noting that q t 1 = (1 + it) where R is the risk-free real rate, p, s can be generalized 
to any asset as in the text. 
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where K t +i = is called the stochastic discount factor or pricing 

kernel. 

For a risk free real interest rate R{ the covariance term is zero so 
that 


E(l + R{) 


1 

EK t+ i 


For the portfolio of assets we have that 

7TV-I I D (1 — COV(Rmt, Rt+l)) 

B(1 + R mt ) = - — - 

We can employ (16), (17) and (18) to obtain 

E{Rit) — R{ = —(1 + R{)cov(Rit , K t+1 ) 


(17) 


(18) 


(19) 


and 


E(Rit) - R{ = 


[covRjt, K t+ i](ER mt Rt ) 
cov(Rmt, Kt- |-l) 


( 20 ) 


Equation (20) illustrates that an asset’s expected real return in excess of 
the risk free rate is higher the more negative the covariance between the 
asset and the ratio of marginal utilities. This is because when consump¬ 
tion is smaller, marginal utility is higher. An asset with a more negative 
covariance thus implies that the return is lower when consumption is 
low. This is the precisely the state however when wealth is more valued; 
consequently the agent requires a higher risk premium to hold the as¬ 
set. This model of the relationship between expected returns is known 
as the consumption CAPM. Equation (20) shows the relationship to the 
ordinary CAPM where covariances with the pricing kernel replace those 
with the market portfolio. 

To make the model empirically operational we have to specify the 
error structure of the model and the form of the utility function. 

Consider the implications of the model if 

c i-« _ i 

<°t) = ( 21 ) 


where 6 is a constant and is called the coefficient of relative risk aversion. 4 
This function has the convenient property that it nests the linear case, 
6 = 0, and the logarithmic form, 6 = 1. (The latter case is found by 
employing L’Hopital’s rule when 6 = 1.) 

For this function 

u'(c t+ 1) _ , Cf + 1 s 

u'(ct ) ^ Ct ' 


( 22 ) 
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In addition we assume that consumption and asset returns are jointly 
conditionally lognormally distributed. This allows us to make use of the 
fact that for a conditionally lognormally distributed variable, Y, 

In E t (Y) = E t [In Y\ + 0.5uar t [In Y] (23) 

Taking logarithms of (15) and employing (23) we obtain 


E t ln(l + R it ) = - ln/3 + 6E t (InCt +1 - lnc t ) 

— 0.5{var ln(l + Rit) + <5 2 var(ln ^ii)) 

c t 

— 2(5cov(ln(l + R it ) ln(^ii))} (24) 

Ct 

(Note let Y t = (1+fl ; i , ) "'j Cf+l) so from (15), ln(l) = \n(5+ \n(E t Y t ). Then 
use (22) and (23) to obtain 

0 = ln/3 + £? t (ln(l + R it ) + 0.5{var(l + R it ) + 

Ct 

<5 2 var(^ii) — 2<5cov(ln(l + Ru) ln(^ii))} 
c t c t 

Rearranging this expression gives (25)) 

If the real rate of interest on an asset, R{, is risk free (24) simplifies 
to 

ln(l + R{) = — ln/3 + SE t (\nc t+ i — In cA — 0.5(<5 2 var(ln ^fii) (25) 

Ct 

When there is a nominal asset yielding a certain nominal rate, N t , 
we can employ the definition of the real rate of interest for this asset, 
say, 1 + R N t, as 


1 + i?jvt 


(1 + N t )P t 

Pt +1 


(26) 


where P is the price level. 

Substitution of (26) in (15) and employing the same method as above 
gives us the required nominal return on the safe nominal asset, assuming 
conditional normality, as: 


ln(l + N t ) = - ln/3 + ^(lnPt+i - lnP t ) + 6E t (\nc t+ i - lnc t ) 

— 0.5(<5 2 var(ln ^ii) + var(7r) + 28cov(\n 7r)) (27) 

Ct Ct 


where n = ln(P t+ i) — ln(P t ) 
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We observe from (25) that the riskless real rate has a linear relation¬ 
ship with the expected change in real consumption. If the riskless real 
rate is assumed constant we can rearrange (25) with expected changes in 
consumption as the left hand side variable. This is the relationship esti¬ 
mated by Hall (1978): under rational expectations changes in consump¬ 
tion should be orthogonal to information dated prior to the information 
set conditioning expectations. 

From (27) we observe that the safe nominal asset responds with a 
unit coefficient to expected inflation and is also determined by terms 
which reflect inflation risk premia. 

Subtracting (25) from (24) we obtain 

E t ln(l + R it ) - ln(l + R{) + 0.5uar(l + R it ) 

= 6cov ^ln(l + i? it ),ln(^p-)^ (28) 

or that 

In E t {1 + R f = Scov fln(l +J? it ) I ln(—)) (29) 

(1 + R{) V d ) 

The implication of this model is that risk premia are determined by 
the covariation between the asset and changes in consumption times the 
coefficient of relative risk aversion. 

Because the covariance term is relatively small in actual data, empir¬ 
ical tests of this model have not been able to explain the excess return 
on stocks over bonds, on average some 6% in the US over 100 years and 
large in other countries as well (see Campbell, 1996), without assuming 
a possibly implausibly large coefficient of relative risk aversion (well in 
excess of 10). This has become known as the equity premium puzzle after 
the pioneering contribution of Mehra and Prescott (1985). Recent con¬ 
tributions have suggested a variety of mechanisms that might resolve the 
puzzle. These include more general specifications of the utility function 
(e.g. Epstein and Zin, 1989, 1991), the introduction of habit forma¬ 
tion (e.g. Constantinides, 1990; Campbell and Cochrane, 1999, 2000), 
survivor bias (e.g. Brown, Goetzmann and Ross, 1995), peso problems 
(see below; Rietz, 1988), heterogeneous agents (e.g. Constantinides and 
Duffie, 1996). Many of these ideas are superbly set out in Campbell, Lo 
and MacKinlay (1997). 
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DETERMINATION OF ASSET PRICE BEHAVIOUR 
UNDER RATIONAL EXPECTATIONS 


We next consider the implications of the assumption of rational expec¬ 
tations for the behaviour of asset prices. Consider the return to holding 
an asset over the period t to t + 1, say a share. We have 


Rt+i — 


Pt +1 + A+i 


- 1 


(30) 


where D t +i is any dividend or payment in the period and Pt, Pt+i is 
the price of the asset at time t and t + 1. 

We assume initially for simplicity that investors are risk neutral so 
that via arbitrage expected returns are equal to those on a riskless asset 
with a real rate of interest, R, assumed constant, so that E t Rt ,+i = R. 

Since Pt is part of the current information set, the expectation of 
(30) when rearranged is given by 


Pt = 


a 


~=r E t p t+i + 


-E f D t 


(31) 


Assuming rational expectations and solving this model forwards N 
periods we obtain 


p — p J^t+i , PtPt.+N 

* ^(l + RY (1 + R) N 


(32) 


In the absence of speculative bubbles (see below) we assume that the 
last term goes to zero as we let N go to infinity. In this case the current 
asset price is equal to the expected value of the stream of dividends into 
the indefinite future. This term is the fundamental of the process which 
we can call F t . 

Under these assumptions we have that: 


Pt = E t 


E 


Dt+i 

(1 + R) i 


We can write (33) in the equivalent form 


(33) 


E 


Dt+i 

(1 + Rf 


Dt_ 
1 -f- R 


Dt E t D f +i E f Dt+t 

1 + R + (1 + R) 2 ~ (1 + R) 2 


EjDt+2 

(1 + R) 3 


EtDt +2 
(1 + i?) 3 


Pt = Et 
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Rearranging (34) we have that 


Pt = E t 


E 


AD t+ i 

(1 + !)* 


Dt AA+i E t D t+2 
1 + 1 + (1 + l ) 2 + (1 + l ) 3 + ‘' ’ 


so that using (33) we can rewrite (35) as 


Pt = E t 


E 


ADt+i 

( 1 + 1 )* 


D t P t 

-= H-= 

l+R 1+R 


(35) 


(36) 


Multiplying through by 1 + 1 and rearranging we obtain the alter¬ 
native form 


Pt 


A ^A+i 

11 ‘ \j-[ (l+l) 4 - 1 


(37) 


Equation (37) shows that the current price of the stock is equal to 
the dividend divided by 1 plus a term in the discounted stream of ex¬ 
pected future changes in dividends. In this form the model is amenable 
to empirical testing in the form of cointegration analysis (see Time-Series 
Annex). If dividends are a non-stationary process but changes in divi¬ 
dends are stationary then the stock price is cointegrated with dividends 
with coefficient =. 

An insightful special case of the above model arises when dividends 
are expected to grow at a constant rate g. 

For this case 


EtDt + i — (1 + g)E t D t j r i-i — (1 + g) 1 A 
Substituting (38) in (33) we obtain 

, (i + g)A (i + g)_ 2 A (i + gfA (H#A 

' (l + l) (l + l ) 2 (l + l ) 3 (l + l ) 4 

We can rewrite (39) as 

p ( 1 +g L A [i i ( 1 + g } i [l+g J i ( 1 + i )3 i 
(i+i) L (i+i) (i+i) 2 (i+i) 3 


(38) 


(39) 


(40) 


Recalling that = 1 + x + x 2 + x 3 + .for | x | < 1 and assuming 

that g < R (as it must be since the stock price is not infinite) we can 
rewrite (40) as 


(1 + ff)_A f 1 \ = (1 +ff)A 

< 1 + *> ll-^j 


(41) 
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This form is the Gordon growth model. It demonstrates the impor¬ 
tant point that if R is close to g then small permanent changes in R can 
have a major impact on the stock price. 

The assumption made in the derivation of the asset price (33) is that 
expected stock returns are equal to a constant risk-free real rate of in¬ 
terest (in the context of CAPM say a constant real rate plus a constant 
risk premium). Relaxing this assumption in the above framework results 
in the loss of analytic tractability since expectations would be of a non¬ 
linear form. It is interesting to note an approximation for the case of a 
variable return that preserves tractability which was initially employed 
by Campbell and Shiller (1988 a, b). 

Taking logarithms of (30) we have that 


ln(l + R t+1 ) = In(P t+1 + A+i) - ln(Pt) = ln(P t+1 ) - ln(P t ) 

Dt +1 


+ hi 1 + 


t+i 


Noting that -pr^ = exp[lnD t+ i — lnP t+ i] we can rewrite (42) 
r t+ i = p t+ 1 -p t + ln(l + exp(d t+ i - p t +i)) 


as 


(42) 


(43) 


where lower case letters now represent logarithms so that r t = ln(l + P t ) 
Letting Zt = d t + 1 — pt+i and taking a first-order Taylor expansion of 
the last term in (43) around the mean value of z we obtain 

GXp {^z') 

r t +1 = Pt+i ~Pt + ln(l + exp(J)) + 1 | _ [rf t +i - Pt+i ~ z] (44) 


Rearranging (44) we obtain 

Pt+i exp(z)d t+ i 


n +1 = 


1 + exp(z) 1 + exp(z) 


1 + exp(z) 


Pt + ln(l + exp(z)) 


2 exp ( 2 ) 

1 + exp (z) 
(45) 


Letting A = 1+e ^ p (_) and noting that 4^-A = exp ( 2 ) and 1 — A = 
(l+expfz)) we can rewrite (45) as 

' 1 - A N 


r t +1 = Xpt+1 + (1 - A)d t+ 1 - Pt ~ ln(A) - (1 - A) In 


(46) 


or 


P+i — Apt+1 + (1 — A)d*+i — pt + 6 
where 6 = — ln(A) — (1 — A) In 


(47) 
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For a constant dividend-price ratio 

^ = i T d 

1 ' p 

The average value of the dividend price ratio observed in economies 
shows that the value of A will be close to unity, around 0.96 in the US 
for example, so that in (47) the weight on the log price is close to one 
and that on log dividends closer to zero. Clearly the approximation 
will be more accurate the smaller the variation in the log dividend price 
ratio. Analysis by Campbell and Shiller (1988a) is suggestive that the 
approximation does not produce gross violations of reality, particularly 
at the monthly level of analysis. Essentially the approximation provides 
analytical tractability at the cost of some error in the statement of aver¬ 
age returns. From an empirical perspective dividends appear to be more 
parsimoniously explained as a loglinear rather than a linear time-series 
process and this is an advantage of the approximation method. 

We can take expectations of (47) and solve forward to obtain 


Pt 


6 

1 - A 


+ E t 


A* [(1 - A) dt+i+i - r t +i+ 1 ] 

„i=0 


(49) 


where we assume once again in the absence of speculative bubbles that 
the term E t A°°p t+00 is zero. 

We observe from (49) that the current log stock price is higher the 
higher expected future dividends and the lower expected future returns 
(i.e. the stock discount factors). 

We can transpose (49) following the type of procedure outlined above 
(34) to obtain the equivalent form 


Pt 


6 

1 — A 


+ dt + Et 


A* (Adt+i+i — rt+i+ 1) 

_i =0 


(50) 


(Hint: in (49) add inside square brackets the term d t — d t + Xd t +i — 

Xd t +i + A 2 d i+ 2 — X 2 d t +2 H-b X n d t + n — A n d t+n , rearrange in terms of 

Adt+i+i- The terms in d t + Xd t+ \ + X 2 d t+ 2 + + A"d i+ „then can be 

substituted out.) 

Equation (47) is useful for illustrating another point.Taking expecta¬ 
tions at time t of (47) and subtracting the resultant from (47) we obtain 


U+i — E t r t+ 1 — X(pt+i — E t pt+ i) + (1 — A)[dt+i — E t dt+ 1 ] (51) 


If we lead (50) one period, take expectations of it at time t and 
subtract the resultant from it, we obtain an expression for p t +1 — E t pt +1 
which we substitute in (51) to obtain 
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r t +1 - E t r t+ 1 = 


A < 


dt+i — Etdt+i + Et +1 


X) A ! (Ad t+ 2+,: — r*+2+i) 

,i=0 


A* (Ad t+ 2+i — U+2+i) 

i=0 


+ (1 — A) (rft+i — Efdt+i) 


(52) 


Recognising that d t+ i-^ t di+i = E t+1 d t+1 -E t+1 d t -(E t d t+1 -E t d t ) 
under the information assumptions of the model, i.e. dt is observable at 
time t, d t+ i at t + 1, so E t+1 d t+ i = d t + 1 , E t d t = d t , we can rewrite (52) 
in the form presented by Campbell (1991) namely 


oo oo 

r t +1 — E t r t+ 1 = -Bt+i A*Ad i+ i + j — E t A ! Ad i+ i + i 
2=0 2=0 

CO CO 

- [£ t+1 2 AVt+1+j - Et ]T AVt+i+i] 

2=1 2=1 

CO 

= {Et +i — E t ) A'(A(i i+ i + j — A?’ t+ 2+i) (53) 

i =0 

Equation (53) is informative. We observe that unexpected stock re¬ 
turns are a function of revisions of expected future changes in dividends 
and revisions of stock discount factors. Further, ceteris paribus an up¬ 
ward revision in expectations of future returns leads to a fall in the price 
today since for a given dividend stream this can only be generated from 
a lower price today. The equation also demonstrates how revisions of ex¬ 
pectations into the indefinite future impact on the current innovation in 
returns. Under rational expectations these revisions are news. However 
this creates a problem in empirical work which endeavours to estimate 
the impact of empirical measures of news on unanticipated asset returns. 
Clearly the impact of current news can have an ambiguous impact on 
unanticipated returns depending on the implications it has for future 
revisions to expectations. For example the news that the current money 
stock is higher than anticipated could be interpreted as a signal for future 
tightening of money stock or a move to a more relaxed regime. Similarly 
output figures higher than anticipated could signal the end of a slump 
or the beginning of an inflationary episode. This implies that without 
knowledge of current and future policy regimes empirical estimates of 
the impact of current news will be ‘non-structural’ so that the estimated 
coefficients may be difficult to interpret. 

We can illustrate two other important implications for the potential 
behaviour of asset prices and returns using a modified example borrowed 
from Campbell, Lo and Mackinlay (1997). 
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Assume that the log of dividends is generated by the process 

dt +1 = pdt + Ut+ 1 (54) 

where u t +1 is a random variable and| p\< 1. So we assume that dividends 
are a stationary process in this example. 

Further assume that the stock discount factor is generated by 

E t r t +i = r + y t (55) 


where r is a constant and y t is generated by the process 


y t = ay t -i + v t 


(56) 


where | a | < 1 and v t is a random variable. 

yt is an observable variable and might represent changing risk. Given 
these assumptions we can substitute in (50) to obtain the solution for 
the price of the asset. The log price of the asset (50) is given by 



The first term in the braces, substituting for future dividends, is given 

by 


(1 — \)[E t dt+i + XEfdt+2 + X~E t dt+3 + ■..] = 

(1 — A )[pdt + A p 2 dt + A ~f?dt. + ...] (58) 

and the second term by 


- {E t r t+ 1 + A E t r t+2 + A 2 E t r t+3 + ...} = 

~{r + yt + A(r + ay t ) + A 2 (r + a 2 y t ) + ... } (59) 


so that substituting in (57) we obtain 


Pt 


6 , , (1 - A)p _ _jr _ y_t_ 

1-A ' 1-A p 1-A 1 — Aa 


(60) 


(recalling again that = l + x + x 2 +x 3 + ... for | x \< 1) 

We can substitute (60) into (47) to obtain the solution for the one 
period stock return as 


n +1 = A 


5 


1 — A T 1-A p 1 — A 
4 + 


Vt+i 

1 — Aa 
r 


1-A 


1 — A p 1 — A 1 — Aa 


+ (1 — X)dt+i 

6 (61) 


Vt 
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Using (54) and (56) and simplifying this gives 


rt+i = r + u t +i +yt ~ (62) 

The solution for log prices illustrates that when the variability of yt 
is small, so that the variability of expected returns is small, this does not 
imply that the variability of prices need be small. This is because the 
denominator of the term in y t can be small. More formally, assuming 
the covariance between y t and dt is zero, we have from (60) that the 
variance of prices is given by 


<77 


_ , (l-A)V 

(1-Aa) 2 (1-A p) 2 


a 2 d where a 2 = 


1 


a* 


(63) 


where a 2 d = prjUs the variance of dividends. 

Clearly a 2 can be small but a 2 large. 

We can also solve for the reduced form ARMA time-series repre¬ 
sentation (see Time-Series Annex) of log returns and log prices in our 
example. 

Using the lag operator, and substituting for y t we obtain for returns 


rt +1 = r + u t +i + 


Vt 

(1 - aL) 


Xv t +i 
(1 — Xa) 


(64) 


Multiplying out the lag operator we obtain 

, x . | v t Xv t +i , . 

n+1 = or, + r(l - o) + «,+, ~ an, + (65) 


The summation of the two error terms on the right hand side of (65) 
can be rewritten as a moving average error process of order one so that 
observed returns follow an ARMA (1, 1) process. Depending upon the 
covariance between news in dividends and news in returns the process 
can exhibit positive or negative autocorrelation. 

Employing a similar process for changes in asset prices we obtain the 
ARMA representation for the level of the log asset price as (where L is 
the lag operator) 


Pt { 1 - pL){ 1 - aL) 

t (1 - \)pu t (l - aL) v t {l-pL) ^ 
(1-A) + 1-A p H (1 - Act) 1 j 

so that the level of the log of the asset price is described by an ARMA(2, 
1) in this example. 
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Although both returns and asset prices are forecastable in this ex¬ 
ample there can be no presumption that the market is inefficient. The 
crucial element in the definition is that abnormal returns should not 
be forecastable in an efficient market. We will observe another exam¬ 
ple of this next where the exchange rate has a predictable path in the 
Dornbusch overshooting model even though the market is efficient by 
construction. 


OPEN ECONOMY MODELS WITH EFFICIENT FI¬ 
NANCIAL MARKETS 

In Chapter 10 we set out the general behaviour of macro models of the 
open economy with efficient markets and New Classical price behaviour, 
under fixed and floating exchange rates. Here we focus in more detail 
on the behaviour of nominal exchange rates under floating, under vary¬ 
ing assumptions about price behaviour. This will illustrate the role of 
financial efficiency per se in open macro models. 

Our model is based on that outlined by Dornbusch (1976) in his 
seminal paper. For simplicity it is assumed that there is perfect capital 
mobility between countries (i.e. transactions costs are negligible and 
international assets are perfect substitutes). Consider initially a risk- 
neutral agent who is faced with the choice between holding a domestic 
or foreign bond for the duration of one period (say 90 days). The nominal 
rates of interest in the foreign country and in the domestic country are 
given by R t and Rf respectively. Since the bonds are perfect substitutes, 
asset market equilibrium requires that the expected rates of return on the 
two bonds be equal. This expected rate of return has two components. 
The first component is the interest rate on the bond, which we can 
assume to be known at t; the second component is the expected capital 
gain or loss from exchange rate changes during the 90-clay period. 

It follows that the speculative condition for equilibrium, known as 
uncovered interest arbitrage is: 

Rt=R? + ( E t S t+1 - S t ) (67) 

where Rt, Rf are the domestic and foreign nominal interest rate, S t 
is the logarithm of the current exchange rate (here domestic units per 
foreign unit) and E t St +iis the expectation of the rate in period one (90 
days in our example). A rise in St in our notation here represents a 
depreciation of the home currency. Equation (67) therefore implies that 
the interest rate differential in favour of domestic bonds must be equal 
to the expected depreciation of the exchange rate. 
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For example, if the domestic currency pays 12 per cent interest (per 
90 days) and the foreign currency pays 4 per cent interest, a domes¬ 
tic investor buying foreign currency at the beginning of the period and 
converting back at the end of the period will, assuming the domestic 
currency depreciates by 8 per cent, expect to finish up with sufficient 
domestic currency to make him indifferent between holding domestic or 
foreign bonds. 

There is also a forward market for foreign exchange in many exchange 
rates (i.e. traders can at time t contract to trade foreign currency at time 
t + 1). In the forward market, large transactors are required to put up 
only very small amounts of money as ‘margin requirements’, so there is 
no need to discount. Consequently, using a similar argument to the one 
above, arbitrage implies the covered interest arbitrage condition that 

R t = R? + ( F t - S t ) (68) 

where F t is the logarithm of the forward exchange rate at time t for 
period t + 1. We note that the covered condition is riskless and holds 
via arbitrage regardless of the manner in which expectations are formed. 
Uncovered arbitrage is a speculative condition, hence the explicit as¬ 
sumption of risk-neutral investors. Equating the covered and uncovered 
condition we obtain that F t = E t S t+ 1 so that the forward rate is a direct 
measure of the market’s expectation of the future exchange rate. The 
properties of the forward rate as a predictor of future spot rates has been 
a focus of much empirical research as we will discuss below. 

In the Dornbusch model it is assumed that prices in goods or labour 
markets are in the short term ‘sticky’ with respect to changes in market 
conditions. This could, for instance, be because of the existence of multi¬ 
period wage or price contracts as in the New Keynesian model. It follows 
from this assumption that purchasing power parity does not hold in the 
short run. Purchasing power parity (PPP) or the ‘law of one price’, 
states that in the absence of transport costs and other transactions costs 
international arbitrage in goods should eliminate differentials between 
the prices of goods in different countries. We discussed in chapter 10 
on the open economy how this would not hold in the short run but 
should hold in the long run, whether traded goods are homogeneous 
across countries or differentiated and imperfectly competitive (PPP in 
the long run but not the short is confirmed by numerous empirical tests, 
e.g. Taylor et ah, 2001). 

Under PPP we would have: 


S t =Pt~ Pt 


(69) 


where p , p F are the logarithms of the domestic and foreign price level. 
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With perfect capital mobility and PPP, from (67) and (69), the equal¬ 
ization of expected real rates of interest immediately follows: 

Rt ~ (Etp t+ i -Pt) = Rt - {EtPt+i - Pt) (70) 

Absence of PPP in the short run means that we may examine the 
behaviour of the real exchange rate, x , defined as: 

x t = S t -pt+Pt (71) 

For simplicity we assume that output (in logs), real interest rates, 
foreign interest rates and the foreign price level (in logs) are fixed and 
normalised to zero; but this does not affect the features of the model on 
which we focus here. The model is given by 


Rt — EtSt +i — St 


(72) 


nit = m t + m = p t — SR t 


(73) 


Pt - p t -1 = k(S t - p t - 1 ) + u t 


(74) 


where mt is a random monetary shock around to, the constant average 
money supply, and u t is a supply shock. Equation (73) is a conventional 
demand for money function equated to money supply. Equation (74) is 
an ad hoc price adjustment mechanism which captures the hypothesis 
that the price level responds sluggishly to changes in the exchange rate, 
via aggregate demand and its effects on the volume of net trade. The 
specification has the convenient property that when k = 1, PPP holds 
instantaneously apart from the current supply shock. 

When PPP holds instantaneously the model can be rearranged to 
give 


_ 6E t St +i ml - u t 

(1 + 6 ) (1 + 6 ) 


(75) 


Equation (75) will be recognised to have the same form as (31) above. 
We solve this model under the assumption that agents have full current 
information. The reduced form for the model is given by 


fern + to* - (1 - k)mt -1 = kS t + u t - 6(E t S t +1 - S t ) + 

6(1 — k){E t -\S t — S t ~\) (76) 


Following the procedures outlined in chapter 2, the solution for the 
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exchange rate in the full-information case is given by 4 

S t = (1 - z)m + zS t - 1 + a 0 m t + aim t ~i + a 2 u t (77) 

where z is the stable root of the equation: 

Sz 2 - z{6{ 1 - k) + k + <5) + <5(1 - k) = 0 (78) 

and cio = fc+6(i- z )+«(i_fc) > a i = ~(1 — ^) a o> a 2 — k+8(i-z) 

We notice from (77) that, in the long run, when S t = S t - 1 , the elas¬ 
ticity of the exchange rate with respect to an increase in the permanent 
level of money supply is unity. In other words, in the long run the ex¬ 
change rate depreciates by the change in m. Given that m is a constant 
(72) implies that R is zero in the long run, and hence (73) that prices 
have the same response to m as the exchange rate. Equation (74) implies 
that in the long run PPP must hold. Leading (77) one period and taking 
expectations we obtain: 


E t S t+1 -S t = (l~z)(m-S t )+a 1 m t (79) 


Equation (77) defines an expectations mechanism known as regressive 
expectations. It informs us that when the equilibrium exchange rate is 
above the current exchange rate, then expectations are revised upwards 
and vice versa. The fact that regressive expectations can by choice of 
the regressive parameter (1 — z) be rational is one implication of this 
Dornbusch model. However, we should note that this property can hold 
in rational expectations models only where there is one stable root; it 
does not hold generally. 5 

We can substitute (77) into (71) and (73) to obtain 


m{ 1 + <5(1 - z)\ +m t - u t - (1 - k)p t ~i 
k + <5(1 — z) 


(80) 


4 For the real interest rate differential to be constant, as investigated by Mishkin 
(1981), an infinitely large intertemporal substitution response is required for either 
a or (3 . The evidence does not support such responses; it is therefore not surprising 
that Mishkin concludes from his reduced-form work that the differential varies over 
time. Similar arguments apply to variation of the real interest rate within a closed 
economy, as investigated by Fama (1975) for the USA; this evidence has now been 
found (by Nelson and Schwert, 1977) to support non-constancy, which again is not 
surprising. We may also note that the size of variation in both the differential and 
the closed (e.g. world) economy level of real interest rates cannot be suggested a 
priori; nor can the length of time to convergence (determined by z in this model as 
influenced by all the parameters). 

5 Relative risk aversion (RRA) is defined as RRA = — = ^ ^ or s P ec ^“ 
cation of the utility function. 
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Equation (80) illustrates another key insight of Dornbusch. 

The impact of a change in the permanent level of money supply (m) 
in the short run, ceteris paribus, is greater than unity except when k = 1 
and consequently greater than the long-run impact. This phenomenon is 
known as ‘overshooting’. The rationale for this effect is that because, in 
the short run, prices are at a point in time sticky or adjusting slowly, the 
only way the money market can remain in equilibrium as the permanent 
level of the money supply is increased is for the interest rate to fall. 
However, a falling interest rate has to be associated with an expected 
appreciation of the currency. Consequently the current exchange rate 
has to depreciate further than its long-run value in order to give rise 
to anticipations of an appreciation as it moves to its ultimate long-run 
value. 

This mechanism is illustrated diagrammatically in figure 14.2. Sup¬ 
pose that at time t = 0 the pound/dollar rate is $1 = £1. At time t = N 
the authorities increase the level of the money supply by 100 per cent. 
In the long run this causes the pound to depreciate against the dollar 
to $1 =£2.0. However, in the short run the pound depreciates further, 
to say $1 = £2.50 and then follows the arrowed path back to long-run 
equilibrium. We notice that along the arrowed path the pound is ap¬ 
preciating, but has always depreciated relative to t = 0. The possibility 
that efficient assets markets, in conjunction with sticky wages or prices, 
could give rise to volatile behaviour of asset prices was a principal insight 
of Dornbusch and has been influential (see e.g. Buiter and Miller, 1981). 



Figure 14.2: Exchange Rate Overshooting 
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We should also note from the solution for the exchange rate that, 
while a positive monetary shock causes a depreciation of the currency, its 
impact can be greater or less than unity, but since m has not changed, 
there will always be an ‘overshoot’ of the long-run equilibrium. The 
Dornbusch result applies to permanent (unanticipated) changes in the 
money stock (to). 

Another important feature of the model is that the exchange rate has 
a predictable path (77). However uncovered arbitrage was assumed in the 
model consequently the asset market is efficient since expected abnormal 
returns (in this example deviations from uncovered interest arbitrage) 
are not predictable. This is another example where predictability of an 
asset price does not violate market efficiency. In particular the exchange 
rate does not follow a random walk, though this is sometimes assumed 
in empirical work. 

A number of authors have attempted to test the Dornbusch model 
(see e.g. Driskill, 1981; Frankel, 1979; Haache and Townsend, 1981; 
Demery, 1984), by examining the properties of reduced-form exchange 
rate equations derived from structural models of the Dornbusch type. 
The empirical results the authors report are unfavourable to the model. 
However there are a number of problems with these tests, the main 
one being that they all assume the regressive form for expectations, 
which is in general incorrect (Minford and Peel, 1983). For example, 
Haache and Townsend (1981) and Frankel (1979, 1982) specify models 
in which lagged adjustment or wealth effects are introduced into the 
demand for money function or lagged adjustment is introduced into the 
interest arbitrage condition; in these specifications, expectations will not 
be regressive. Box 14.1 shows that overshooting also can, but does not 
necessarily, occur in equilibrium models. 


Box 14.1 

EXCHANGE RATE OVERSHOOTING IN EQUI¬ 
LIBRIUM MODELS 

We now examine whether exchange rate overshooting, which occurs 
in the Dornbusch model as a result of disequilibrium in goods or 
labour markets, must always be regarded in a real economy as oc¬ 
curring as a consequence of such features. We will demonstrate that 
this is not the case. Overshooting can indeed occur in equilibrium 
open economy models. In order to explain this, we adopt the model 
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of the previous section. 

In keeping with the equilibrium framework, we assume that all 
agents form expectations on the basis of the same set of macro in¬ 
formation. This we date at t — 1, and for simplicity ignore current 
global information (but this does not alter our point). 

Hence the exchange market equilibrium condition becomes: 

Et-iS t+1 -S t = R t (1) 

and the real interest rate differential, r, is defined as: 

ft = Rt ~ Et-iPt+i + Pt ( 2 ) 

(We have for simplicity set the foreign real interest rate to zero.) 
Demand for money is: 

m t =Pt + yt- A Rt (3) 

Money supply is: 

Amt = e t + A rn t + Av t (4) 

where 

Afrit = u t (5) 

This money supply function now allows not only for (unanticipated) 
temporary changes in the level of money (v) and once-for-all changes 
in the level (e) but also for once-for-all changes in the steady state 
rate of increase (u). It will thus permit us to examine the different 

OO 

reactions to these shocks, (note rn t e t-i + rn t . + v t where 

i =o 

OO 

m t =Yj u t—i) 
i =0 

From the definition of real interest rates and the real exchange rate, 
Xt = S t — Pt, and from (1) we have: 

(Rt — Et-iPt+i + Pt) — Et-i^t+i ~ E t _ip t+ i — (St — Pt) = 

Et-iXt+i - x t (6) 

i.e. the real interest differential must equal the expected real depre¬ 
ciation. 

Now complete the model with an IS and Phillips curve: 


Dt = -ar t + Sx t 


( 7 ) 
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yt = Pn + j(p t - E t ^ipt) + ay t -i (8) 

Equation ( 8 ) has the full classical form discussed in chapter 3. Note 
that neither a nor (3 is infinite, so that r will vary. The model has 
been set up so that y = r = x = 0 in equilibrium. Notice that it 
belongs to the same Mundell-Fleming family as that of chapter 10. 
Define the superscript lue ’ as ‘unanticipated at t-1’; hence for exam¬ 
ple p“ e = pt~E t -iPt- Equations ( 6 )-( 8 ) can be solved as a recursive 
block in terms of p“ e to give: 

X t = pXt-l + 7 T 0 Pt e + (7Tl - pTTo)Ptl i ( 9 ) 

where 7 r 0 = a+ ^ +g , 7 n = <jtt 0 and p is the (assumed 

unique) stable root of the characteristic equation 

'‘ 2 -< 1 + ^)f‘+w^r = ° a") 

ol T p ol T p 

r and y have similar solutions: a first-order moving average in p ue 
and first-order autoregressive coefficient p. From (3) using these, we 
obtain: 

Pt e = qrnT (H) 


7r 0 gm“ e 


where q = r 1 +A+ A 7 ro 1 K a +^)^ Q ] i s greater than 0 and less than 1. The 
nominal exchange rate depreciation is: 

Sr=xr+pr = Q0- + *o)rnT (13) 

where m“ e = e t + u t + Vt 
We can usefully rewrite 


<7(1 +7T 0 ) = 


a + P + 6 + 7 

(o! + P + 5)(1 + A) + q(A + ol + 6) 


which makes it clear that the value is positive and greater or less than 
unity depending on all the impact parameters (however if A + a + 6 
is greater than 1 then it must be less). Notice that overshooting 
properties occur in the broad sense that both the nominal and the 
real exchange rate depreciate in a ‘volatile’ manner in response to 
positive money shocks. 

For the case where the money supply is growing over time we define 
overshooting as a reaction of the nominal exchange rate by a greater 
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proportion than the change in the (current) equilibrium nominal ex¬ 
change rate, i.e. that which would prevail were the present money 
supply difference to be maintained in perpetuity, apart from ele¬ 
ments expected to be reversed (u t ). 

On this basis, we can determine from (13) that: 

1. The exchange rate may respond more than proportionately to 
a once-for-all change in m (e*), and so the equilibrium exchange 
rate. This is the overshooting considered by Dornbusch (1976), 
which deals with surprise shifts in the permanent level of m. 

2. It also may respond more than proportionately to a rise in 
m which is due to a permanent rise in its growth rate ( u ), 
However, since there is no way that speculators can distinguish 
between e and u shocks when they occur, the reactions to both 
are the same. 

3. It also responds to a temporary change in m (v), which on 
our definition does not change the equilibrium exchange rate. 
This is also a form of overshooting, though not that dealt with 
by Dornbusch. 

All these types of overshooting in response to monetary shocks are 
qualitatively the same as those in the ‘sticky price’ models of Dorn¬ 
busch and Frankel, yet they emerge from an equilibrium model. By 
altering our assumptions about the availability of current global in¬ 
formation, these results could be easily ‘enriched’ to give a variety 
of potential overshooting responses; substantial overshooting is ex¬ 
hibited in empirical application by an equilibrium model of the UK 
economy (Minford, 1980, the Liverpool model). To sum up, volatility 
of the nominal exchange rate (overshooting), as well as of the real 
exchange rate and real interest differentials, is not prima facie evi¬ 
dence of ‘price stickiness’, ‘disequilibrium’ or ‘inefficiency’ in goods 
or labour markets. 

Distinguishing Equilibrium from Disequilibrium Models of the Ex¬ 
change Rate? 

For good measure we can show that, on the basis of a reduced form 
exchange rate equation on its own, it is not possible to determine 
whether it comes from an equilibrium or disequilibrium model. For 
this purpose, we set up two models identical in all respects except 
in their ‘supply’ behaviour. 

The first model is the one just dealt with; it consists of the demand 
for money function (3), money supply function (4), efficient market 
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condition (1), and IS curve (7) and its supply curve is an equilibrium 
one (8). The second model consists of the same equations apart from 
(8) where it has a sticky price Phillips curve like Frankel’s, namely: 

Pt - pt-i = bx t -i + Amt (15) 


(We also assume in the spirit of disequilibrium models that specu¬ 
lators have full current information, and condition the expectations 
operator throughout on the basis of current information.) It turns 
out that the solution for Xt in this model is: 


Xt — (A 2%t—l 


[e t + A u t + (1 — Pi 1 ){vt — t>t-i)] 
M 2 (A + a ) 


(16) 


where /x 2 is the stable, p 1 the unstable root of the characteristic 
equation: 


L \ + a' A + a 

Compare this to the solution for the equilibrium model: 

X t = X t -1 + 7T 0 <?m“ e + q( 7Tl - 7T 0 )m“f! 


(17) 


(18) 


This shows that it is not possible to distinguish between the equi¬ 
librium and disequilibrium models on the basis of the reduced-form 
(real) exchange rate equations alone; both are ARMA(1,0,1) time- 
series models (see Time Series Annex). It follows that the models 
can only be distinguished, if at all, on the basis of full structural 
estimation. This is another example of ‘observational equivalence’ 
(see chapter 15). 


EMPIRICAL EVIDENCE ON MARKET EFFICIENCY 

Our interest here is in the empirical evidence that testing for market 
efficiency sheds on the rational expectations hypothesis. In these tests, 
either part of the joint hypothesis may fail: the model of equilibrium 
expected returns or the RE hypothesis. But this is unavoidable in testing 
a hypothesis about expectations which are not directly observable. 

Modelling of the equilibrium expected return is clearly crucial in em¬ 
pirical tests of market efficiency. As the examples above have illustrated 
asset prices or returns can have predictable patterns without necessarily 
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violating the efficient markets hypothesis. The crucial element is that 
abnormal returns should not be systematically predictable. 

Since a property of rational expectations is that any difference of 
outcome from the expected outcome is unforecastable from available 
information, we have that : 

Rit = EtRit + i>it (81) 

so that vn is independent of EtRit, the rational expectation. Substitut¬ 
ing the determinants of E t RitRom CAPM, for example, gives: 

Rit = Rt + Qit. + Vit (82) 

where q,t is the risk premium. If q lt were constant (82) can be esti¬ 
mated by ordinary least squares (with the coefficient on R t constrained 
to unity), and the estimated error term, zy), should be independent of all 
information available at the beginning of period t: not merely past zyj 
(weak-form), but also all relevant data (such as money supply, inflation 
and growth). This is known as an orthogonality test. It is also possible 
to estimate (81) freely, in which case the coefficient on R t should not be 
significantly different from unity, a further check on the joint hypothe¬ 
sis: the other tests apply as before. A further implication of (81) is that 
any trading rule, TR (a systematic rule for trading assets), which uses 
information at the start of t, including past zy), to buy and sell asset i, 
intending to make profits because 

E[Ru — EtRit} | TR > 0 (83) 

must fail under the efficiency hypothesis since by (81) 

E[Rn — EtRit] = 0 (84) 

Since under the efficiency assumption any trading strategy has an 
expected abnormal return of zero it must do worse, given the increased 
transactions costs associated with an active rule compared with a trading 
strategy of buy and hold (do nothing). The returns to each rule differ 
by expected transactions costs. 

An important class of models in which efficiency prevails (abnormal 
returns are unforecastable) is that of martingales. Formally if a variable 
Z t is described by the stochastic process 

E[Zt +1 | Zt%t- 1, ■ • • = Zt (85) 


or 


E[Zt +1 — Z t | Z t 'Zt- 1 ,... ] — 0 


(86) 
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it is known as the martingale property. 
The process 


Zt+1 — Zt + Ut +1 (87) 

is therefore a martingale. The error term has the property that E t Ut+ 1 = 
0. The error can exhibit structure such as time varying heteroskedasticity 
(e.g. ARCH effects - Time-Series Annex). If a constant term is added 
to (87) it is known as a submartingale. The terminology of random 
walk and martingale are often interchanged. In fact a random walk as 
conventionally defined is a stronger concept in that the error term is 
assumed to be independently and identically distributed (iid). 

So the process 

Zt+ i = Zt + ct + Ut+ i (88) 

where a is a constant and E t ut +1 = 0, is a submartingale and a random 
walk if Ut -|-i is iid. 


THE RATIONALE FOR MARKET INEFFICIENCY 

Broadly the implicit mechanism involved for the market to be efficient 
in the sense of Fama is that if asset markets were not efficiently aggre¬ 
gating and processing information the disparity between fundamental 
values and market prices would present traders with profit opportuni¬ 
ties. Rational speculators it is argued will, essentially instantaneously, 
drive asset prices back to their fundamental values. 

The case for market inefficiency rests on either or both of the pre¬ 
misses that prices of assets move when there is no new information con¬ 
cerning fundamentals and that the action of speculators may not move 
asset prices towards their fundamental values. 

Some empirical observations that are offered as consistent with these 
premisses are market crashes such as October 19th 1987 when world 
markets fell around 20 per cent without apparently any important news 
being evident which could account for the fall, or sustained rises in prices, 
known as bubbles, that are seemingly inexplicable in terms of market 
fundamentals. 

Also it is observed that many trading decisions are based on past 
prices. Chartism, the extrapolation of past prices, is widely employed as 
the basis for trading rules. In addition, there is extensive use of stop-loss 
orders, whereby an asset is sold if its price falls by a pre-specified amount 
as well as the growth of dynamic hedging strategies such as portfolio 
insurance where investors buy (sell) into rising (falling) markets. Whilst 
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none of these activities necessarily imply market inefficiency they raise 
the question as to whether informed rational speculators are arbitraging 
out the trading rules of any uninformed or irrational traders; if they are 
not, then inefficiency may be present. 

The first theoretical model of inefficiency we examine is that of ra¬ 
tional bubbles. 


SPECULATIVE BUBBLES 


History is replete with examples where asset prices have exhibited dra¬ 
matic increases then falls which it is argued are not readily explained 
by movements in the ‘fundamentals’ of the asset such as the expected 
future dividend stream. One example is the South Sea bubble in the UK: 
in the 18tli century the stock of a company which traded in the South 
Sea experienced exponential price increases before plummeting in value. 
Another is the tulip bulb episode in Holland in the late sixteenth century 
where (single) tulip bulbs were exchanged for land and gold before tulips 
ultimately became near worthless (though see Garber, 1989). 

It is argued that these episodes represent speculative bubbles in which 
the anticipation of future capital gains leads to spiralling upward price 
movements before the bubble eventually collapses or pops and the price 
exhibits dramatic falls. Since the 1980s there has been a considerable 
amount of theoretical and empirical work on speculative bubbles. Our 
purpose in this section is to provide an introduction to this literature; 
our discussion is related to our earlier one in chapter 2 where we showed 
that rational expectations with expectations of future variables can have 
bubbles in their solution. 

We assume for simplicity that investors are risk neutral so that via 
arbitrage expected returns are equal to those on a riskless asset with 
rate of return R, assumed constant, so that E t Rt,+ 1 = R. Given these 
assumptions we obtain (31) above, which we reproduce for convenience: 


Pt = 


a + R) 


=-E t P t+1 + 


(1 + R) + 


(89) 


Assuming rational expectations and solving this model forwards N 
periods as in 33 we obtain 


Pt = Et 


r n 


r 


Dt+i 


EtPt+N 
(1 + RY 


(90) 


The second term in (90) is the discounted value of the stock price 
N periods in the future. In the absence of bubbles as we let N go to 
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infinity we assume that this term goes to zero. If so, the current asset 
price is equal to the expected value of the stream of dividends into the 
indefinite future: this expression is the fundamental of the process, F t . 

The idea of speculative bubbles is that equation (90) is also consistent 
with rational expectations solutions other than the fundamental solution. 
If we try the solution 


Pt = F t + B t 

in (89) we find that as long as B t follows the process 

(1 + R)B t = E t B t+ 1 


(91) 

(92) 


it is a valid mathematical solution to (89). 
By substitution of (91) in (89) we obtain 


/) — Ft F Bt —- EtBt+\ T - EtFtM\ F - EtDt-\-i 

(1 + R) + (1 + R) + ( 1 FR) 


(92) is consistent with this and the solution form for 

Ft — - EfFt-i-i +--— EtDt-\-\ 

(1 FR) (1 FR) + 


is the same as (90) where tends to zero as N tends to oo. In 

other words if agents believe that the process B t is driving asset prices 
then it will be a “rational” solution, a “self-fulfilling prophecy”; notice 
that it implies E t Pt+N = E t F t +N F E t Bt+N which explodes endlessly, 
so that the bubble cannot be expected to burst ever. 

As a consequence of the rational expectations assumption 


B t+ 1 = E t B t+ 1 + e t+ i (93) 

where e *+i is a random error. 

Substitution of (92) into (93) gives 

Bt+i = (1 + R)B t F et+i (94) 

(94) makes clear that the deterministic solution for B is an asymp¬ 
totically explosive process. It is therefore important that there be no 
transversaility condition putting a limit on this process; such a condi¬ 
tion was how we ruled out bubbles in chapter 2. 

We can also write the solution for the bubble in the form (see Salge, 
1997) 


(95) 
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where a = 


1 

U+K) 


and 


EtMt +1 — M t 


(96) 


implying that M t is a martingale process. 

Leading (95) one period and taking expectations, noting that is a 
deterministic process, we obtain 


E t B t +1 = 


EtMt+i 

a t+1 


from (96) . 


c/Bf — Bt 
a t+1 a 


(97) 


Consequently a bubble contains a martingale component and any 
process for M t satisfying this condition is a valid rational bubble. 

A general form of a stochastic martingale process is given by 


Mf — Pt^t- i + u t v t 


(98) 


where the random variable p t , has conditional expectation that E[p t+l \ 
It] = 1. Other assumptions are that E{v t +i} = 0, i = 1 — n, , E[p t M t \ 
It] = 0, E[p t v t | It] = 0, E[p t u t | I t ] = 0, E[u t M t \ I t ] = 0, E[u t v t j 
It] = 0, E[v t M t | I t ] = 0. 

These assumptions establish the independence of all stochastic vari¬ 
ables in all leads and lags. No restricting assumptions are required con¬ 
cerning the nature of the random variable u t . 

Suppose lit = 1; define a random variable ht that is normally dis¬ 
tributed with mean p and variance o\. We can then exploit the property 
of conditional log normality and define p t for any arbitrary constant A 
as 


Pt= e 


Xh t 




(99) 


because 


E e Xhi = Ee Xht+1 = e x > 1 +—T J 


( 100 ) 


Taking expectations of (99) the condition Ep t+1 = 1 is satisfied. 
Consequently we can write the martingale process of (98) as: 


M t = e 




M t -1 + v t 


( 101 ) 


Substitution of (101) into (95) gives the implied bubble as: 


B t = 


Xht — \ A/iH-K-k-+lnc»! 


Bt -1 + e* 


( 102 ) 


where e t = ^ and e lo s“ = 1. 

L a. T a. 
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(E t et+i = 0 for i > 1. Note substitution of M t from (95) into (98) 
(with u t = 1) gives B t = a~ 1 p t B t -1 + employing the defintion of p t 
from (99) we have (102)). 

Several different types of bubbles can be obtained exploiting the in¬ 
sight that rational bubbles include a martingale process. In fact any 
process that follows a martingale process can be included in the bubble. 

For example suppose the fundamental can be described by a martin¬ 
gale process 

F t =F t _ 1 + 9 t (103) 


where 9t is a random variable. 

In this case we can let M t be: 

F t 

M t = F t so that the bubble is Bt = — (104) 

or 

For more general specifications of fundamentals the method is to 
substitute appropriate random variables for h t . For example let us add 
a constant component to fundamentals so that they follow the process 


Ft — F t - 1 + p + r/ t (105) 

where r/ t is IV(0, afj. In order to find the bubble that corresponds to 
this process we inspect the general bubble formulation (102). We can 
substitute for the variable h t in this formulation so long as the process 
we substitute in has a mean of p and a variance because for these 
parameter values we can obtain the martingale process (101). F t — F t -\ 
in (105) has a constant mean (p) and variance a\ so we can substitute 
F t — F t -1 for h t into the general bubble formulation (102) (with e t = 0 
assumed zero here for simplicity) to obtain 


B t 

B t -1 


A(F t -F t _i)- 

= eL 


A fi-\ -2"-^ L +l n ot 


( X 2 cr 2 \ 

A/iH-j-^+ln aj t 


AFt-i- 

e 


A/^H-j-^+ln ot 


it- 1 ) 


(106) 


(106) implies that the bubble process in fundamentals corresponding 
to the fundamental process (105) is given by 

Bt = e h-(^ +b “)'] = e [AF t -(AM+^)t-lna‘] 


If fundamentals follow the geometric process 


In Ft - In Ft-! = p + p t 


(108) 
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where rj t = N( 0, cr\), we obtain following the above procedure 


B 


Bt A(lnF f —lnF f _i)-( A/J+ - 7 h +ln t 

= eL 


t -1 


A In F t — ( Xfi-\ -+ln a ) t 


i A^er^ \ 

AlnF f _i- A-j-^+lna ) (t-1) 


which implies 


A In F t — [ AuH-rj-^+lnalt . 

B f = e V ) =F x e 




(109) 


( 110 ) 


(note e AlnF * = F x ). We observe that if A^ A 2 CT> ‘ + lna = 0, then the 

bubble process is given by 


= F a (111) 

Alternatively if A = 0 

= 4 ( n2 ) 

a' 


When a bubble depends on its own value in the previous period it 
is called a Markovian bubble. When the bubble depends on fundamen¬ 
tals it is called an intrinsic bubble (Froot and Obstfeld, 1991). When 
the bubble depends on arbitrary processes it is called an extrinsic or 
extraneous bubble.The solution for extraneous bubbles follows the same 
procedure as for intrinsic bubbles. For instance if the extraneous pro¬ 
cess, S t: follows a martingale process then the rational bubble is given 
by B t = |if. Salge (1997) demonstrates how to solve for bubbles for the 
general ARMA specification of fundamentals or extraneous variables. 
Depending upon the specification of A a variety of different bubble pro¬ 
cesses are feasible. 

If bubbles were non-stochastic, et+i = 0, then the solution to (92) is 
simply 


B t = B 0 (l + R) t 


(113) 


so that the solution for the price would simply embody a determinis¬ 
tic explosive component and would in principle be readily amenable to 
statistical tests. 

Whilst the possibility of such deterministic bubbles have been inves¬ 
tigated empirically for periods of hyperinflation before monetary reform 
(Flood and Garber, 1980), it would appear that bubbles of a determin¬ 
istic form are not features of asset or other prices. As a consequence 



372 


Confronting Models with Facts 


bubbles would not apear to be plausible empirically unless there is a 
significant probability that they will collapse after reaching high levels. 

Blanchard (1979) and Blanchard and Watson (1982) proposed a prob¬ 
abilistic bubble that can embody this feature. Essentially there are two 
regimes which occur with constant probabilities q and 1 — q. In the first 
state (A) the bubble survives with probability q and continues to in¬ 
crease at an expected rate of E t B t+ i/A = (l +^ fi/ . In the second state 
( C ), with probability 1 — < 7 , the bubble collapses so that E t B t+ i/C = 0. 

Adding stochastic terms we have 


Bt +1 = 


(1 + R)B t 

q 


+ 64+1 with probability q in state A 


and: 


(114) 


B t+ 1 = e t+ i with probability (1 — q) in state C (115) 


where the error has the property that E t e t +1 = 0 As a consequence we 
have 

E t B t+ 1 = q + (1 - <?)0 (116) 

We note from (116) that the bubble has the form of equation (92) and 
is therefore a valid rational expectations solution. In addition we observe 
that in state A the bubble grows at a faster rate on average (1) than the 
non-probabilistic bubble in order to compensate for the probability of 
collapse. Collapsing bubbles are consistent with the analysis employing 
martingales; we simply define the process 

M t = —M t ~ 1 +u t Vt with probability 7 r (117) 

7r 

and 

Al t = UtVt with probability 1 — 7 r (118) 


We can also derive the expected excess return, which is defined as 
the return on the asset which incorporates the bubble minus the rate of 
return on the riskless asset in regime A and C. Defining the expected 
excess return at the end of period one as W+i = E t Rt+i — R, then from 
(89) 

X t+1 = E tPt+i+EtD t+ i _ (1 + ^ ( 119 ) 

Cs 

Taking expectations at time t of (91) at time t + 1 we obtain 

EtPt+i = E t F t +1 + E t Bt +1 (120) 
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where 

E t F t+ 1 = E t 
now 


E, 


t+i 


Dt+i +1 


ti ( 1 +^ 


£ 


EtFt+1 + EfD t +i — E t 


E>t+i +1 


F I \ " A+i+i 

‘lit (!+^ 


Et.Dt+i — 


( 121 ) 


S(1 + RY _ 

(l+^)F t = (l + R)(P t -B t ) (122) 
so substituting (122) and (120) into (119) we obtain 
(1 + R)(P t - B t ) + E t B t+1 


X t+1 = 


(l + R) 


(123) 


In regime A we substitue for E t B t+ i from (116) into (123) to obtain 
expected excess returns as 


X t+1 = 


(1 + R)B t { 1 — q) 
qPt 


(124) 


In regime C we substitute for E t B t+ i from (116) into (123) to obtain 
expected returns as 


Xt+i 


(1 + R)B t 
Pt 


(125) 


We observe from (124) and (125) that expected excess returns differ 
substantially in the two periods. However expected excess returns are 
zero across the two regimes. Rational bubbles do not create a predictable 
pattern in excess returns; rather they create volatility in prices. 

Whilst the the above probabilistic bubble has the property of collaps¬ 
ing, Diba and Grossman (1988) show that the impossibility of negative 
rational bubbles in stock markets (because of the implication of a neg¬ 
ative expected asset price in the future which violates limited liability) 
implies that a bubble once collapsed can never restart. Essentially if 
the bubble takes a zero value, then from (92) its expected value in the 
future is zero. Because negative values are ruled out for stock prices this 
implies that the average of the positive values must be zero, but this is 
a contradiction, so it can only be zero if it takes the value of zero in all 
future periods. This point also carries the implication that if a bubble 
exists today it must have always existed, i.e. since the moment trading 
began. 

Evans (1991) and Van Norden (1996) have formulated processes for 
bubbles that meet this theoretical point. Van Norden allows for the 
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possibility that the bubble is expected to collapse only partially in state 
C and that the probability of a bubble’s continued growth falls as the 
bubble grows 

Van Norden specifies that the probability of the bubble’s continued 
growth is given by 

q t = q(B t ) with jrjQ < 0 ( 126 ) 

(the absolute value of the bubble to allow for negative bubbles in markets 
such as exchange rates.). 

In regime C the bubble is expected to collapse only partially and is 
given by 

E t B t +1 = u(B t ) in state C with probability q t (127) 

where u(.) is a continuous and everywhere differentiable function such 
that u(0) = 0 and 1 > > 0. 

In state A the bubble is expected to grow at rate 

EtBt +i = {1+ ( n\ Bt - [1 ~ q{ f^ {Bt) with probability 1 - q t (128) 
q{B t ) q(B t ) 

It is easy to verify that (128) and (127) imply equation (92). We can 
deduce from (128) that the expected value of the bubble in the surviving 
state is a decreasing function of the probability of survival q{B t ). As a 
consequence, as with the Blanchard bubble, the greater the probability 
of collapse the larger is the expected gain on a surviving bubble in order 
to compensate the investor for the possibility of collapse. 

Van Norden’s specification of the bubble process is interesting in 
demonstrating how the probability of collapse can be readily endogenised, 
and also how the process can be modified so that the bubble is not nec¬ 
essarily zero in the collapsed state. Both these features could be relevant 
in empirical work on detecting bubbles. 

Evans formulates a bubble that is always positive but nevertheless 
periodically collapses. The Evans bubble takes the form 

Bt +i — (1 + R)BtUt- i-i if Bt < k (129) 


and 


Bt +i — 


0 t+1 (l + i?)[S t -(l + i?)-M] 


u t + 1 if B t > k 


(130) 


where k and 6 are positive parameters with 0 < 6 < (1 + R)k and u t +i 
is an exogenous independently and identically distributed (iid) positive 



Testing the Rational Expectations Hypothesis 


375 


random variable with E t Ut +1 = 1. Ot+i is an exogenous i.id Bernoulli 
process( independent of u) which takes the value unity with probability 
q and zero with probability 1 — q. 

Taking expectations at t of (129) we observe that the bubble is equa¬ 
tion (92) when it is in the regime where B t < k. When B t > k the bubble 
has the expected value of 


EtJ3 t +i = 


L , (1 T R)[B t - (IT i?)' 1 #]] 


[(IT R)B t <%l-c?)l 

o -\- 

l Q \ 


9 9 


with probability q (131) 


and 


E t B t+ 1 = 6 with probability 1 — q 


(132) 


so that when B t > k 


E t B t+1 = q 


(1 + R)B t 6(1-q) 


+ (1 — q)6 — (IT B)Bf (133) 


Consequently the bubble satisfies condition (92). 

The Evans bubble has the property that when B t < k it grows at 
mean rate IT R. When B t > k the bubble ‘erupts’ and grows at a faster 
mean rate as long as the process continues. When the bubble collapses 
it falls to a mean value of 6 and the bubble process begins again. Evans 
notes that by varying the parameters 6, k and q one can create bubbles 
where the frequency with which the bubble erupts, the average length 
of time before collapse and the scale of the bubble vary. An example of 
an Evans bubble is depicted in Figure 14.3. 

An important feature of the Evans bubble is that standard empirical 
tests for bubbles based on unit root and cointegration methods (Time- 
Series Annex) may not detect bubbles when they are present. Evans 
simulated bubbles and applied unit root tests to the simulated bubbles. 
Even though bubbles are asymptotically explosive for q < 0.75 for more 
than 90% of the simulated bubbles of length 100 observations, statistical 
tests for unit roots rejected the hypothesis of a bubble in favour of a 
stable alternative. 

More recent empirical work has endeavoured to apply alternative 
econometric approachs such as switching regime regression models to as¬ 
certain the presence of bubbles (Hamilton, 1994; Van Norden 1996; Van 
Norden and Vigfusson, 1998) given that the Evans-type model describes 
behaviour for different regimes. These statistical methods appear to offer 
greater promise of detecting bubbles. As yet however it is probably fair 
to say that the different empirical methods which have been applied to a 
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Bubble Component B for Advanced Simulation 


Figure 14.3: Example of an Evans Bubble 


variety of different asset markets including stock and exchange markets 
have not generated any consistent evidence of speculative bubbles. 

It should be noted that theoretical work by Tirole (1982, 1985) has 
demonstrated that bubbles cannot arise in certain models. For example 
in an overlapping generations model with an infinite number of finite- 
lived representative agents, Tirole shows that when the interest rate ex¬ 
ceeds the growth rate in the economy (so that it is dynamically efficient) 
a bubble would ultimately violate some agents’ budget constraints. It 
would appear that bubbles could only exist in economies which are dy¬ 
namically inefficient if the model is of the representative-agent type and 
agents are rational. From this perspective Froot and Obstfeld (1991) 
suggest interpreting empirical tests for bubbles as tests of the rational 
expectations assumption. 

We can put this another way. RE models are, generally, ‘over¬ 
identified’. That is, everything in the reduced form is derived from 
structural relationships and more restrictions exist than reduced-form 
parameters; the reduced-form errors are combinations of the structural 
parameters and structural errors. Errors such as bubbles are not present 
in the structural model and hence should not appear in the reduced form. 
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Only fundamentals are included in it. 

Should we write down a structural model in which some bubble vari¬ 
able is allowed to enter, then we must ensure that it satisfies both the 
transversality conditions on the model (including private and budget 
constraints as they tend to infinity) and the optimality conditions. For 
example, as we showed in chapter 2, deterministic explosive bubbles in 
prices will violate reasonable government limits on inflation and so will 
be ruled out from the start. In the Blanchard and Watson example re¬ 
viewed above, of a stochastic bubble, at least one of the paths must have 
a positive probability of continuing indefinitely for the bubble even to 
begin. Is such a path consistent with such limits? Above we noted the 
objections of Tirole to bubbles in one context, and the caveats expressed 
by Obstfeld and Rogoff. In each case of a bubble we must ask whether 
theory as above permits it. 

When we turn to empirical testing, we can note that the ‘forward 
solution’ of the model discussed in chapter 2 has the form of a bubble. 
One can for example write down a model of prices in which a large 
positive future shock (to the money supply, say) is expected with some 
probability; as the period approaches the price level rises until at the 
period it either jumps upwards if the shock occurs or collapses if it does 
not. Distinguishing this non-bubble solution from a supposed bubble 
clearly is difficult. Remember a bubble relates to what people in markets 
expect for the future; this could well relate to an expectation about a 
fundamental. 

The difficulty with bubbles is therefore both theoretical and empir¬ 
ical. This critique of bubbles within rational expectations models does 
not of course extend to models where expectations are not rational; here, 
by construction, people may believe in curiosa - ‘fads’ and so forth. To 
such models we now turn. 

Fads and Noise Traders 

Shiller (1984), (1989) and (1997) has emphasised the importance of mass 
psychology in financial markets with the implication that investors may 
exhibit fashions or fads. A fad is a depature from fundamental values 
due to ‘psychologically’ induced changes in market sentiment. We first 
illustrate the implications following the model in Faina and French (1988) 
and Cutler, Poterba and Summers (1990, 1991). 

Let 


Pt = F t + et 


(134) 


where Pt is the return or price of an asset, F t is the fundamental value 
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and e< is a fad. Fundamentals are assumed to follow the process 

F t = F t _\ + h t (135) 

The fad is described by the stationary process 

e t =pe t -i+v t (136) 

where p is a positive constant less than unity and v t is a random error. 

Cutler, Poterba and Summers (1991) consider the case where a proxy, 
F^, is available to measure fundamentals, given that in applied work we 
typically do not measure the fundamentals with precision or are employ¬ 
ing approximations to the theoretical model. Let 

F t =F t *+w t (137) 

where w t is the measurement error assumed to be random. 

Differencing (134) and substituting from (135) and (136) we obtain 
the true reduced form for A P t as 


AP t — (p — l)(Pt_i — Ft-1 ) + ht + Vt (138) 

Consider the regression estimate of /3 in the relationship 

A P t =a + 0(P t - 1 -F t *_ 1 )+Ot (139) 

The regression coefficient will have a probability limit of 

(p-VCovjPt-! - - F;_t) 

var(P t -1 - F*_t) 

_ {p - l)Cov(e t -i, e t -i + w t - i) 

Var(e t -i + w t -i) 

= Jp-lWe) 
var(e) + var(w) 

The implication is that changes in asset prices will be predicted by 
lagged deviations of the price from its proxy fundamental (for which in 
their empirical work Cutler, Poterba and Summers (1991) employ the 
log of the real dividend). In addition prices or returns can be shown to 
exhibit a predictable pattern. 

Of course predictable patterns in the difference between prices and 
fundamentals raises the issue of why rational speculators do not arbitrage 
the process. It is implicitly assumed that they face some sort of liquidity 
constraint preventing them from dominating the market. However, it is 
also possible that they do so arbitrage, so that the fad occurs but is then 
removed from the data next period. This would correspond to p = 0 
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in our example. In this case, prices could differ from their fundamental 
value so implying inefficiency, and yet the difference between the price 
and fundamentals could still be unpredictable. This distinction points to 
a caveat in the interpretation of efficiency given by Fama; Sliiller makes 
the point that the absence of predictability does not necessarily imply 
market efficiency. Clearly the existence of systematic abnormal profits is 
sufficient to invalidate efficiency. However absence of predictable abnor¬ 
mal returns could be consistent with market inefficiency if prices were 
more volatile than was consistent with the volatility in the fundamental. 
Bubbles or some fad processes are consistent with this interpretation 
of inefficiency. Efficiency therefore requires the complete elimination of 
fads from the price process (e* = 0); otherwise a trading rule operating 
on P t = F t would make money (this is a particularly simple rule, viz. of 
current arbitrage). 

Other models of market inefficiency assume the coexistence of hetero¬ 
geneous traders. In these models there are smart traders, the rational 
speculators, as well as the noise traders. The noise trader trades on 
fads or charts or other extraneous information (‘noise’). The key re¬ 
quirement of the models is that arbitrage activity by the smart traders 
is limited. This can be a consequence of the assumption of risk-averse 
rational speculators combined with a micro-structure rationale where 
for example arbitrageurs have limited capacity to borrow funds due to 
signalling problems (see e.g. Shleifer and Vishny, 1997). 

The consequence is that arbitrage activity by the rational traders 
may not eliminate the influence of the noise traders from the outcome 
for returns or prices so that the market is inefficient in aggregate (see 
e.g. Figlewski, 1979). 

Other specifications of this type of model have the informed traders as 
not holding rational expectations but rather forming their expectations 
on the basis of a relatively well-informed rule such as PPP deviations 
for exchange rates (section below) or price-dividend deviations for stock 
prices using say the Gordon (1962) model. If it is assumed that the pro¬ 
portion of the two types of agent in the market varies with the extent of 
the deviation of the asset price from its long run value, then these mod¬ 
els can exhibit complex nonlinear dynamics of the asset price including 
chaotic outcomes (Time-Series Annex; and e.g. De Grauwe, Dewachter 
and Embrechts, 1993). 

Some recent models of asset price determination investigate more for¬ 
mally the micro-structure determining asset prices. An important class 
of models consider the implications of assuming that traders do not be¬ 
have competitively and take prices as given. The models are solved using 
the game-theoretic Bayesian Nash equilibrium concept where the strate- 
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gies of other traders rather than prices are taken as given. These models 
allow analysis of strategic interactions in which traders take their price 
impact into account (see Brunnermeier, 1999, for a survey). In essence 
each trader recognises that large trades will move prices against him. 
The models incorporate features such as asymmetric information, noise 
traders, speculators, market-makers and explicit assumptions concerning 
the mechanism by which traders submit orders to trade. They produce 
many interesting insights. 

When it is assumed that traders have short investment horizons, per¬ 
haps due to financial constraints which make arbitrage activities cheaper 
for short-term assets, herd behaviour can result (see e.g. Froot, Scharf- 
stein and Stein, 1992). In addition strategic interaction can give rise to 
rational profit-maximising speculators, accentuating asset price volatility 
caused by noise traders as they strategically exploit feedback elements 
such as herding (see e.g. De Long, Shleifer, Summers and Waldman, 
1991). 

The models can also generate market crashes as in e.g. Romer (1993) 
where information is heterogeneous and investors are uncertain about the 
quality of information other investors have. 

Although this literature is rich in insight and can provide a rationale 
for market inefficiency, it remains an empirical issue whether efficiency 
as defined by Fama is violated. We now turn to some empirical tests. 


EMPIRICAL TESTS OF MARKET EFFICIENCY 

There has been an enormous amount of empirical work which has ex¬ 
amined the efficiency of asset markets including gambling markets (see 
e.g. Sauer, 1998; Thaler and Ziemba, 1987). Violations of the efficient 
markets hypothesis are typically referred to as anomalies. The anoma¬ 
lies reported may constitute evidence of abnormal returns or violations 
of the rational expectations assumption. Before considering some of this 
evidence it is important to note that the dangers of data-driven infer¬ 
ence or data-snooping whilst always a consideration in applied analysis 
(see Learner, 1978, for a discussion of pretest bias) needs to be given 
formal weight when interpreting some anomalies as demonstrated by 
Sullivan, Timmermann and White (1999, 2001). In the limited sample 
sizes typically encountered in studies, significant relationships or sytern- 
atic patterns are bound to occur if the data are analyzed with sufficient 
intensity. For example the orthogonality property of rational expecta¬ 
tions will be rejected by the usual statistical criteria five percent of the 
time. Clearly if authors are searching in the universe of variables and 
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significant rejections are reported it could appear that the hypothesis is 
invalid even though it is true. 

An apparent rejection of the market efficiency hypothesis which is 
data-driven is the finding reported by numerous researchers that stock 
returns exhibit seasonal regularities (see Sullivan, Timmermann and 
White, 2001, for key references). Abnormal returns are related to day of 
the week, week of the month, month of the year, holidays etc. If these 
results have validity then the efficient markets hypothesis is in serious 
trouble given it is ‘such a simple violation’. 

The issue is one of ‘data-snooping’ — in other words, researchers 
cannot avoid being influenced by their sample of data. They ‘find’ rules 
that are in that set of data by repeated trials. This will occur even 
when some data is kept aside - because this data too is known about to 
some degree and trials on it may lead to respecification of the rule on 
the used data before a renewed trial. To control for this problem the 
researcher must not allow the data to influence his choice of rule. Thus 
all possible rules must be given an equal chance to work on the data; 
some of course will work well for particular samples even if they do not 
in general work at all. By doing this exercise for all rules for repeated 
samples one may calculate the probability of a rule ‘doing well’ in a given 
sample. Clearly for a rule to be considered significant it must do better 
than this. This exercise has been carried out by Sullivan, Timmermann 
and White (2001) for calendar effects. 

They construct a universe of calendar trading rules, using permuta- 
tional arguments that do not bias in favour or against particular calendar 
rules.They consider some 9500 calendar effects. They imagine that this 
set of calendar rules was not inspected by any one individual investor 
or researcher. Rather the search for calendar rules has operated sequen¬ 
tially across the investment community or researchers with the results 
of investors or researchers being reported through the survival of the 
‘fittest’ calendar rules. Their findings are striking and important. They 
find that although many calendar rules generate abnormal returns that 
are highly significant when considered in isolation, once the dependencies 
operating across different calendar rules are allowed for, then the best 
calendar rule is no longer significant. In addition they consider a smaller 
number of 244 rules that remove any doubts that irrelevant rules are 
pooled with‘genuine’ rules in the 9500 experiment. Their results are the 
same. The apparent statistical significance of the best calendar effects 
is not robust to data-snooping effects. 
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Weak-form efficiency tests 

Weak-form efficiency in the sense of Fama implies that the expected 
returns from a trading rule based on past prices should be less than those 
generated by a buy-and-lrold strategy equal to the number of trades times 
the transactions costs. In fact according to Taylor and Allen (1992) and 
Brock, Lakonisliok and LeBaron (1992) chartist or technical techniques 
are widely employed in asset markets as either a basis for published 
technical commentary or for direct use. The term technical analysis or 
chartism is given as a generic title for any trading rule based on the past 
history of prices or returns. Essentially more or less sophisticated rules 
are employed to extrapolate past changes in asset prices or returns. One 
common example is to employ two moving averages of past returns one 
based on a short horizon, say 1 to 10 days, and the other on a long moving 
average, say 150 to 500 days. When the short moving average rises above 
(or falls below) the long moving average this is a signal to buy (or sell). 
A variant of this procedure is to modify the rule by introduction of a 
band around the moving average to eliminate ‘whiplash’ signals when 
the two moving averages are close. 

There is a plethora of empirical work that has investigated the efficacy 
of technical analysis — e.g. Alexander (1961); Fama and Blurne (1966); 
Taylor and Allen (1992); Brock, Lakonisliok and LeBaron (1992); Neely, 
Weller and Dittmar (1997); Sullivan, Timmermann and White (1999). 

Whilst many rules developed by chartists have been reported to gen¬ 
erate abnormal returns, and some theoretical analysis — e.g. Brown 
and Jennings (1989); Blume, Easley and O’Hara (1994) — demonstrates 
that technical analysis can have value particularly for small less widely- 
followed assets, the empirical results of Sullivan, Timmermann and White 
(1999) appear to be of importance in the interpretation of chartism. Fol¬ 
lowing a similar methodolgy to the one outlined above they construct 
7846 parameterizations of trading rules which are applied to the Dow 
Jones Industrial Average over the full period from 1897 tol986 as well 
as four subperiods. The period 1987-1996 is employed to evaluate the 
rules on a hold-out sample. Their idea is to develop a test statistic 
which evaluates the performance of a chartist rule relative to the full set 
of models that gave rise to the rule, so that the effect of data-snooping 
is explicitly allowed for. One important conclusion is that even though 
a particular trading rule is capable of producing superior performance 
of almost 10% during the sample period, which is significant at 4% level 
when considered in isolation, the fact that the trading rule is drawn from 
a wide universe of rules means that its effective data-snooping-adjusted 
probability value is 0.9, i.e. totally insignificant. They also find that 
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the best-performing trading rule in sample is totally insignificant out of 
sample even at conventional statistical levels. Clearly chartist rules may 
exist which generate abnormal returns; however the analysis of Sullivan, 
Timmermann and White (1999) demonstrates that great care has to be 
taken in their interpretation. 

Another issue concerning chartists or technical rules is that they can 
be interpreted as special cases of univariate time series of a linear or 
nonlinear form (Time-Series Annex). If there is predictability in past re¬ 
turns it would appear that employment of an explicit univariate model 
would in general dominate. Studies such as Diebold and Nason (1990), 
Lane, Peel and Raeburn (1996) are suggestive that nonlinear univariate 
models for modelling the conditional mean of a series provide improve¬ 
ment over linear models but do not appear to forecast better than linear 
models, with neither of them generating abnormal returns (though see 
Granger and Pesaran, 1999).The results of Hsieh (1991) who found that 
stock returns exhibit nonlinear structure but that this is parsimoniously 
captured by a model linear in mean but nonlinear in variance (Time- 
Series Annex) is probably the model that is currently thought of as 
most applicable to asset prices or returns. Clearly chartist methods may 
conceivably deal more parsimoniously with nonstationarities or regime 
changes than explicit nonlinear models though it is not altogether clear 
why this should be the case. 

Overall it would appear that the market efficiency hypothesis has not 
as yet been invalidated at the weak-form level. 

Semi-Strong Market Efficiency 

Numerous empirical tests of semi-strong efficiency exist. Some of the 
work reporting anomalies would appear to be subject to the data-snoop- 
ing caveat. Others are harder to explain. In the latter context we men¬ 
tion the empirical work that demonstrates that monthly or quarterly 
stock returns are predictable (e.g. Campbell, 1987; Faina and French, 
1989; Pesaran and Timmermann, 1994, 1995, 2000). Consideration of 
the loglinear form of the present value model illustrates that if expec¬ 
tations of future dividend change are not too noisy then stock returns 
may have predictability in that variables that help predict future returns 
may exist. This would not invalidate market efficiency. The question is 
whether the predictability has economic value in the sense of generating 
abnormal returns. From this perspective standard measures of forecast¬ 
ing accuracy have modest information content, since they do not allow 
for transactions costs or map into the nature of the profit decision. For 
example a forecast that the market would rise by 12% when it rises by 
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5% is inferior from the perspective of a squared error measure of forecast 
accuracy to a forecast of 0%. Clearly a decision rule of the buy-sell type 
would value the forecast differently (see e.g. Pesaran, 1992; Granger and 
Pesaran, 1999). It is not clear as yet whether the predictability in asset 
returns generates abnormal returns particularly when the potential for 
changing risk is allowed for. 

Strong-Form Market Efficiency 

Insider trading in the major asset markets is subject to legal restrictions 
in a number of countries and has been illegal for a number of years. For 
instance in the US the Securities and Exchange Act of 1934 prohibits 
agents from trading securities while in possession of material inside infor¬ 
mation. Insiders are defined to include not only corporate insiders but 
also anyone who obtains material, non-public information from a cor¬ 
porate insider, or from the issuer, or who steals such information from 
another source. In subsequent years further acts were passed and in 1988 
insider trading sanctions were further increased so that infringement can 
result in fines of up to one million dollars or five to ten years in jail. In 
the UK insider trading is a criminal offense and can result in up to two 
years in jail. As a consequence of this, indirect methods have to be em¬ 
ployed to ascertain whether insiders can make abnormal returns on the 
basis of their private information. That insider trading does take place 
is prima facie supported by studies such as Keown and Pinkerton) 1981) 
who found that on average 40-50% of the price gain experienced by a 
target firm’s stock occurs before the actual takeover announcement. 

Insiders who obey the law are in a number of countries, such as the 
US and UK, required to notify the stock exchanges of their ‘routine 
trades’ in the shares of their companies. This information is published 
reasonably quickly. A number of researchers have investigated whether 
this legal trading by corporate insiders can predict future stock market 
returns - e.g. Finnerty (1986), Friederich et al. (2000), Gregory et al. 
(1994), Pope et al. (1990), Seyhun (1986, 1992).These tests are semi¬ 
strong tests, since the information set underpinning the tests is public. 
The evidence is suggestive that excess returns can be obtained though 
the measurement of normal returns is an issue in some studies. 

Jeng et al. (1999) examine the returns to corporate insiders them¬ 
selves (based on legal trades) using a comprehensive sample of reported 
insider transactions from 1975 tol996. Their carefully-conducted study 
provides evidence of significant abnormal returns. 

Meulbroek (1992) investigates the returns to illegal trades by insiders. 
Her sample consisted of individuals charged with insider trading during 
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1980-1989. Her data base included information on the charges brought, 
penalties incurred, profits earned, number of securities traded, type and 
source of the inside information. Among the defendants who traded, the 
median defendant transacted in one security and reaped $17,628 in profit 
per security. By analysing security prices on the days insiders traded 
and did not prior to public announcements Meulbroeck could investigate 
the impact of insiders on stock price movements. She concluded that 
insider trading increases stock price accuracy by moving stock prices 
significantly. She found that the abnormal price movement on insider 
trading days is 40-50% of the subsequent price reaction to the public 
announcement of the inside information. It would appear therefore from 
the studies conducted so far that strong-form efficiency is not empirically 
supported. 

The arguments for and against the legality of insider trading are 
discussed in e.g. Ausubel (1990), Benabou and Laroque (1992), Leland 
(1992), Khanna, Slezak, and Bradley (1994), Tighe and Michener (1994), 

We now consider in more detail some tests for semi-strong market 
efficiency. 

Variance Bounds Tests 

An important test of the efficiency of present value models of asset prices 
was originally developed by LeRoy and Porter (1981) and Shiller (1979, 
1981). This exploits the property of rational expectations that the vari¬ 
ance of the outcome is greater than the variance of the rational expec¬ 
tation forecast. 

For a variable Y t 


var(Ft) = var (E t Y t+i ) + var(u) (141) 

where i>t is the forecast error. This follows since rational expectations 
implies that the covariance between the forecast and the forecast error 
is zero. 

In models of asset prices rational expectations implies that the actual 
variance of an asset price, P t , which is a weighted average of future 
expectations of fundamentals should be less than the variance of the 
asset price computed on the basis of the actual stochastic process of the 
fundamentals. 

For simplicity we consider the model in level rather than log level 
form. The asset price has the value, as we have seen in e.g. (33), of: 
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where D t+l are the fundamental returns and 1 + R is the constant dis¬ 
count rate. 

Assuming rational expectations we can decompose the present value 
relationship into two components one of which is the perfect foresight 
path and the other the sequence of rational expectations prediction er¬ 
rors: 


= y _D^_ y 

^ n _i_ m* ^ 




tn 1 + R ) i 

where the Ut+i are the forecast errors. Consequently 

P* =Pt+v t 

where 


D, 


p* =y ~^±l. and vt =y -^±u 

yyi + RY tyi + ny 


From (144) 


var(P t *) = var(P t ) +var(r; t ) 


(143) 

(144) 


(145) 


Hence plainly var P f <var P*. 

The striking finding of the early literature was that stock prices ap¬ 
pear to move too much to be consistent with subsequent changes in 
actual fundamentals. For example Shiller (1981, b) employing annual 
data on price and dividends from 1871 tol979 for the US stock market, 
so that the actual dividend stream is employed up to 1979 plus a proxy 
for 1980 to infinity (typically the actual last value of the asset price) 
to compute the variance in 1871, found that the actual asset price was 
some five times higher than the perfect foresight asset price. Leroy and 
Porter (1981) obtained similar results employing measures of earnings 
as fundamentals. 

Two statistical problems with this analysis were pointed out by Flavin 
(1983), Kleidon (1986) and Marsh and Merton (1986). Computation of 
the sample variances in (144) involves use of the sample mean. In small 
samples Flavin and Kleidon demonstrate that use of the sample rather 
than population mean leads to a bias towards rejecting efficiency. 

Marsh and Merton demonstrate that the variance bounds tests pro¬ 
posed by Shiller are not appropriate if the process generating the funda¬ 
mentals are non stationary. 

Mankiw, Romer and Shapiro (1985) develop a variance bounds test 
which is robust to these two points. Consider a naive forecast of the 
asset price, P” 
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where P" is the naive expectation of future fundamentals (say a constant 
growth of dividends model). 

We can write 


Pt - p t = ( p t - Pt) + (Pt - P?) (147) 

Taking the square of the left- and right-hand sides of (147) and taking 
unconditional expectations of t information 

E{P* - P t n ) 2 = E(P* - P t f + E(P t - P t n ) 2 (148) 

since by rational expectations E(P* — Pt)(Pt — P T) = 0 (he. the rational 
expectations forecast error, the first term in braces, is orthogonal to the 
second term) 

Equation (141) implies that 


E(P t * - P”) 2 > E{P* - P t ) 2 

(149) 

E{P* - P t n ) 2 > E(P t - P”) 2 

(150) 


The first expression states that the perfect foresight path is better 
forecast by the actual asset price than the naive forecast; the second 
expression states that the perfect foresight path fluctuates more around 
the naive path than does the actual price. 

On the basis of tests that exploit these bounds Marsh and Merton 
also reject efficiency though the violation is smaller than in the original 
tests. In defense of the efficiency hypothesis we should note, as pointed 
out by various authors (e.g. LeRoy and LaCivita, 1981), that the models 
assume a constant discount rate or constant realized returns if the model 
is cast in loglinear form (see e.g. Campbell, Lo and Mackinlay, 1997), 
and the assumption that agents are risk neutral. 

Empirical Tests: The Foreign Exchange Market 

The foreign exchange market in particular has been looked at particularly 
exhaustively and is clearly of great interest for macroeconomics. We 
examine some of the empirical issues in detail in this section since many 
of the issues are relevant to other empirical work. A building block 
of many analyses is that covered interest arbitrage holds. Since this is 
an arbitrage condition, violation of it would be highly damaging to the 
efficient markets hypothesis. Numerous empirical studies have examined 
the covered condition. It appears that, once appropriate allowance is 
made for transactions costs and care is taken to ensure that data in 
empirical tests is sampled at the same moment in time, the condition 
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is violated very infrequently (see e.g. Taylor, 1987, 1989). From the 
uncovered and covered arbitrage conditions we obtain that the logarithm 
of the forward exchange rate is equal to the market expectation of the 
logarithm of the future spot rate defined by the maturity date of the 
interest rate. In the case of a one period horizon we obtain 

F* = EtS t+ 1 (151) 

Under the assumption of rational expectations we can rewrite (151) 
as 

S t +i = Ft + u t+i (152) 

where u t +1 is the rational expectations forecast error so that u t +\ = 

S t +i ~ E t S t+1 . 

Since St+i, and F{; are observable, early tests of the rationality of F% 
were conducted by estimating the relationship 

St.+i = cto + oil F$ + e *+1 (153) 

and testing whether c(o= 0 , aq= 1 and the error term exhibited serial 
correlation. 

These tests were supportive in general of the efficiency of the forward 
rate as a predictor. However it was realized subsequently that because 
the spot and forward rate appear to be nonstationary 1 ( 1 ) processes 
(Time-Series Annex), a coefficient of unity obtained in estimates of a\ 
would be a weak test, even if appropriate standard errors were employed, 
since non-rational predictors which shared a common trend would exhibit 
this property. 

Fama (1984) reported regression estimates for numerous currencies 
in the post-war floating period of the form 

St+i — St = 7o + 7i {Ft ~ St) + £t+ i (154) 

Comparison of (152) and (154) shows that if the estimates of 77 and 
7 0 are not significantly different from one and zero respectively then the 
forward rate is an efficient predictor. 

In fact Fama obtained the striking result that the estimates of were 
negative and significantly different from unity (or zero). Qualitatively 
the same results have been obtained for numerous different data sets and 
time periods. 

For example using monthly data from 1978.01 to 1990.07 McCal- 
lum(1994) reports the following results for the Dollar/Pound 

S t+ i = —0.0137 + 0.977F* if = 0.96 


(155) 
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S t+ i - S t = -0.0078 - 4.7403(F? - S t ) R 2 = 0.111 (156) 


Whilst employing weekly data for the Dollar/Pound between Decem- 
berlOth 1921-May 20thl925 we obtain 

S t+4 , = 0.121 + 0.920F t R 2 = 0.86 (157) 


,57+4 - S t = 0.0056 - 3.3166(F t * - S t ) R 2 = 0.043 (158) 

The coefficients on the forward premium are significantly different 
from unity in these regressions and also significantly less than zero. The 
negative coefficient implies that not only is the forward rate an ineffi¬ 
cient predictor, it is also inferior to the spot rate as a predictor of the 
future spot rate. These results appear to imply that the foreign exchange 
market is inefficient. Whilst this may be the case, and directly observed 
survey data of exchange rate expectations support such an interpretation 
since they typically exhibit bias and inefficiency (see e.g. Frankel and 
Froot, 1987, and Froot and Frankel, 1989) a number of possible resolu¬ 
tions of this apparent anomaly have been suggested which are consistent 
with market efficiency. We might also note in passing that if the out¬ 
come was due to inefficient expectations it poses a major problem for 
economic analysis given the systematic nature of the bias of the forward 
premium over 70 or so years. 

The first explanation of the anomalous finding was set out by Fama 
(1984): the existence of a variable risk premium. 

A time-varying risk premium 


If we relax the assumption that investors are risk averse then agents 
will require a risk premium for undertaking the uncovered position. A 
variety of alternative theoretical derivations of the risk premium have 
been proposed (see e.g. Bekaert, 1996; Hansen and Hodrick, (1983), 
Hodrick, 1989; Sibert, 1989). In what follows we use the Consumption 
CAPM model. 

The ex-post nominal return, (1 + i?f), to an uncovered domestic 
transaction, is 


1 + (1±R()£4±1 

St 


(159) 


where R;f is the risk-free nominal foreign interest rate and s t is the ex¬ 
change rate measured as domestic currency per unit of foreign currency. 
Covered interest arbitrage for the risk-free nominal domestic interest 
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rate, (1 + R t ) is given by 


1 + Rt — 


(i + R{)ft 

St 


(160) 


where ft is the forward exchange rate in levels. (Note that (68) is an 
approximation to (160) where log(l + Rf) ~ Rf, and S = Ins) 

Substitution out of the foreign interest rate from the covered condi¬ 
tion and substitution in (159) gives 


i + Rt 


(l + Rt)st +1 

It 


(161) 


The real ex-post domestic return from an uncovered speculation (1 + 
R t ) is given by 


1 -fjs _ (1 + Rt)st+iPt _ (1 + Rt)st+iStPt 
JP^l “ StftPt+l 


(162) 


where P is the price level. The real ex-post domestic return from the 
risk-free nominal bond (1 + Rt) is 


1 + Rt 


(1 + Rt)Pt 
Pt +1 


From the consumption CAPM 


E t f3(l + R°)u'(c t+ 1 ) 
u'(c t ) 


EtP( 1 + Rt)u'{ct+ 1 ) 

it'{c t ) 


(163) 


(164) 


Assuming conditional log normality and the form of utility function 
in (21) we can employ (164) and (162) to obtain 


E t S t +i = Ft~ 0.5var(log ^-) + cov([logs t+ i/s t ] logP t+ i/P t ) + 

St 

^cov([log s t+ i/st] log Ct+i/ct) (165) 

(note again: F = log f,S = log s). 

The risk premium reflects the covariation between changes in the 
logarithm of the exchange rate and the logarithm of changes in con¬ 
sumption. It is interesting to note that if 6 = 0, so that the consumer is 
risk neutral there are still terms driving a wedge between the expected 
future spot rate and the forward rate. These terms occur by virtue of 
what is known as Jensen’s inequality. 

If 


ft — F t St +i 


(166) 
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where we measure say in pounds to dollars we also require measuring in 
dollars to pounds that. 



1 

St+l 


(167) 


However by Jensen’s inequality 


E t 



> 


1 

E t a t+1 


(168) 


We cannot simultaneously have (166) and (167) holding which would 
appear to imply that one party can make expected nominal profits, 
known as Siegel’s (1972) paradox. The resolution is that traders are 
interested in expected real profits. Given the assumption of lognormal 
distributions (165) shows the appropriate condition. The terms resulting 
from Jensen’s inequality are generally presumed to be small but Sibert 
(1989) shows that this assumption may be inappropriate. 

Addition of a risk premium to the uncovered arbitrage condition 
implies, in conjunction with the covered condition which remains un¬ 
changed (it is riskless), that 


Ft — E t S t+1 +p t 


(169) 


where p t is the ‘risk premium’ (inclusive of all the terms). 

Subtracting the spot rate from both sides of (169) we obtain that 


Ft — St — E t St+ 1 — St + Pt. (170) 


Now consider the regressions 

Ft — St +1 = Oil + Pi(Ft — St) + eit+i 


(171) 


and 


St +1 — S t — 0-2 + /?2 (Ft — St) + £2t+l 


(172) 


where the a and (3 are constants and the e are the error terms. 
The least squares estimates of (3 1 and (3 2 are given by 


Pi = 


cov(F t - S t+1 ,F t - S t ) 


var (F t - S t ) 


a 2 (p) + cov(p t ,E t St+i - S t ) 


<y 2 (p) + 2co y(p t , E t St +1 - St) + <j 2 (E t St+i - S t ) 


and 


/?2 = 


cov(<St + i - S t , F t - S t ) 

var (F t - S t ) 


(173) 
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a 2 (E t S t+ 1 - S t ) + cov(p t , EtSt+i - S t ) . 

a 2 (p) + 2cov(p t , EtSt+i - S t ) + a 2 (E t S t+1 - S t ) ( ’ 

(recall in the least squares regression Y = a + (3X , the estimate of 
(3 = c °al-( jq ^ ) an d that under rational expectations cov(e t+ i, any variable 
at t) = 0. Note also that S t +1 — S t = E t St +i — S t + e*+i, where e t +i is 
the rational expectations forecast error. 

We note the regressions are mirror images of each other with the 
property that j3 l + (3 2 = 1, an + «2 = 0 and eu+i + e 2 t+i = 0. 

We note from the numerator of (174) that if the estimate of f3 2 is 
negative this is consistent with a time-varying risk premium if the covari¬ 
ation between expected changes in the spot rate and the risk premium is 
negative and greater than the variance of expected changes in the spot 
rate. 

Because /3 1 is positive this implies that the variance of the risk pre¬ 
mium is greater than the covariation between the expected change in 
the spot rate and the risk premium. Together these two conditions also 
imply that the risk premium must have a variance greater than the vari¬ 
ance of expected changes in the spot rate. Hodrick and Srivatava (1986) 
showed how a negative covariation between expected changes in the spot 
rate and the risk premium could be obtained in a properly formulated 
theoretical model. Consequently the negative coefficient obtained in the 
regressions of changes in spot on the forward premium could be consis¬ 
tent with rational expectations and a time varying risk premium. Un¬ 
fortunately data which would allow direct proxying of the risk premium 
are not available at the frequencies required. Time series methods based 
on modelling the risk premium as a function of the variance of the spot 
rate or forecast errors have not provided much direct support for a vari¬ 
able risk premium, though the models of the risk premia are possibly 
poor approximations - see e.g. Baillie and Bollerslev (1990) and Do- 
mowitz and Hakkio (1985), Bollerslev and Hodrick (1996) Engel (1996). 
In addition the magnitude of the variance of the risk premia implied by 
the estimates of /? 2 are regarded by some as perhaps implausibly high 
(though see Pagan, 1988). For example an estimate of f3 2 °f ~4 implies 

that var (p t ) > 5var (F t -S t ) (note = -4= 1 - 
is a different way of writing the estimate of /3 2 ). 

A second potential explanation of the empirical findings is called the 
‘peso problem’, to which we now turn. 

Peso Problems 

The expression ‘peso problem’ was probably first employed by Milton 
Friedman in his analysis of the behaviour of the Mexican currency, the 
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peso, in the early 1970s. Although the Mexican exchange rate was fixed 
between April 1954 and August 1976 at 0.08 dollars per peso, Mexican 
interest rates were substantially above US interest rates. This finding 
presented an apparent puzzle, since rational expectations of a continuing 
fixed exchange rate regime would, given negligible risk premia, imply via 
uncovered interest arbitrage near equality of Mexican and US interest 
rates of similar type. Friedman rationalized the differential in terms of 
a market expectation of a devaluation of the peso. In fact in August 
1976 these expectations were subsequently realized when the peso was 
allowed to float and fell some 46% in value. The peso problem thus refers 
to a situation where rational agents anticipate the possibility of future 
changes in the data-generating mechanism of economic variables. 

To illustrate the nature of the peso problem suppose there are two 
possible economic regimes in which the variable yt follows the process 

Vt = V\ + Oyt-i + tit in regime 1 (175) 

or 

yt=y 2 + u t in regime 2 (176) 

where y 1 , y 2 and 9 are constants and Ut is a random variable which is 
assumed to be the same, for simplicity, in each regime. 

Consider the rational expectation of yt+i formed on the basis of the 
information set fl available at time t, E(y t+i | H t ). We assume that 
rational agents do not know which regime will prevail in the next period; 
rather they have probabilities qt that regime 1 will occur and 1 — q t that 
regime 2 will occur. 

The rational expectation of yt+i is then given by 

E(yt +1 I fit) = qtiVi + Oyt) + (1 - q t )y 2 (177) 

The peso problem in its starkest form can be illustrated by assum¬ 
ing that only one of the regimes is ever observed in a sample of data. 
Consider the case where only regime 1 occurs. The rational expectations 
forecast error for this case is given by 

Vi + Oyt + ut +1 - E(y t+ 1 | fit) = u t+ 1 + (1 - q t )[y i - y 2 ] + (1 - Qt)0y t 

(178) 

We observe from (178) that the ex-post forecast error in this case will 
exhibit bias and exhibit correlation with variables in the information set 
when expectations are formed (yt)- In fact if 9 = 1, in our example the 
forecast error will be non-stationary, a possibility pointed out by Evans 
and Lewis (1993, 1994). 
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It is also interesting to note what happens when a standard orthogo¬ 
nality or efficiency test for rational expections is run on data generated 
by this outcome, assuming q t is constant: let the researcher estimate the 
relationship 


Vt +1 - E(y t+ 1 | fl t ) = a + f3(E(y t+ i \ fl t ) - y t ) + e t +i (179) 


where a and f3 are constants and et+i is the error term. 

The least squares estimate of (3 =c:ov(Y, X) / vax(X) where Y is the 
dependent variable and X the independent) is given by 


P 


-0(1 ~g) 
(1 - 9q) 


(180) 


Consequently for 8 > 0 the regression estimate of P, which is nega¬ 
tive, would imply that the rational expectation of yt+i was an inferior 
predictor to the current level of y t . 

Non-occurrence of a regime (event) in a particular sample provides 
the extreme example of the peso problem. Such possiblities could occur 
when for instance there is a small probability of a change in regime. 
Such cases would not be amenable to differentiating empirically between 
rational expectations and an alternative unless the probability q and the 
alternative regimes could be described. 

Evans (1996) has set out a general method of analysis of the peso 
problem some of which we now set out. We can define the rational 
expectations forecast error, e*+i, as 


e*+i = yt+i — E(y t+ 1 | fl t ) (181) 

To examine how the properties of the forecast error are affected by 
the presence of discrete changes in regime Evans supposes that y t +1 
can switch between two processes which are indicated by changes in a 
discrete-valued variable, Z t = {0, 1}, so that Z t only takes the value 
zero or unity. Let y t +i{z) denote realized returns in regime Z t +1 = 2 . 
The peso problem is to consider the behaviour of forecast errors, yt+i — 
E(y t + 1 | fit)- Evans does this by decomposing realized returns into the 
conditionally expected return in regime z, which is denoted E(y t +i(z) \ 
fit) and a residual u t +i (which for simplicity is assumed to be the same 
in both regimes). 

The decomposition has the following form: 


yt+i — E(yt+ 1(0) | fit) + AE(y t +i \ flt)Z t +i + ut +1 (182) 

where 


A E(y t+1 | fit) = E(y t+ i( 1) | fit) ~ E(y t+ 1 ( 0 ) | fl t ) 


(183) 
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Substitution of (183) into (182) and setting Z t+ i as either one or zero 
informs us, as pointed out by Evans, that it will always be possible to 
decompose y t+ 1 in this way regardless of the process yt+i follows in either 
regime or the precise specification of the information set fl t . Given the 
assumption of rational expectations E(u t +i \ fit.) = 0, so that the error 
term u t + 1 has the conventional rational expectations properties, Evans 
defines this error as the within-regime forecast error, since it represents 
the error when the t + 1 regime is known. 

When agents are unaware of the regime in t + 1 their forecast errors 
will differ from the within-regime errors. Taking expectations of both 
sides of (182) on the basis of f l t information, we have from the properties 
of rational expectations that 

E(y t+ i | fit) = E(y t+1 (0) | fl t ) + A E(y t+1 \ fl t )E(Z t+1 \ fl t ) (184) 

where we note that the right-hand side now contains the expected regime 
in t + 1. 

Subtraction of (184) from (182) and substitution in (181) gives the 
ex-post forecast error as 

et+i = Ut- i-i + AE(y t+ i | fl t )[Z t+ \ — E(Z t+ \ | f2 t )] (185) 

We observe from (185) that when the regime at t + 1 is known, 
so that Z t+ 1 — E(Z t + 1 | Q t ) = 0, the forecast error is the standard 
within-regime error so that there is no peso problem. When the future 
regime is unknown the second term in (185) adds a component to the 
within-regime error (given that the within-regime forecasts differ so that 
AE(y t+1 | fl t ) ? 0). 

Evans illustrates this point more clearly by supposing that in period 
t + 1 regime 1 occurs. From (185) with Z t+ i = 1 we obtain that 

e*+i(l) = Ut+\ + AE(y t +i | O t )[l — E(Z t +i \ O t )] (186) 
or 

et+i(l) = Ut+ i + AE(y t+ i | fit.) Pi{(Z t+ i = 0 | H t )} (187) 

where Pr denotes probability. 

When the probability of regime 0 is non-zero the rational expectation 
error contains an additional component which is the difference between 
the within-regime forecasts multiplied by the probability that regime 
0 occurs. As we illustrated with our example above when the within- 
regime forecasts differ this can generate an ex-post rational errors which 
may have a non-zero mean, so that they appear to be biased or serially 
correlated, and so inefficient. It is also apparent from (185) that the 
extent to which this issue will be important in empirical evaluations of 
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forecasts will be dependent upon the frequency of regime shifts in the 
data sample. In the extreme case when only one regime occurs the errors 
will match those in (186). When there are a number of regime changes 
the forecast error will be a weighted average of e t+ i(l) and e t+ i(0) (de¬ 
fined anologously to (186)).The effect in this case on the properties of 
the forecast error will depend on the sample properties of the forecast 
error for regimes Z t+ \ — E(Z t+ 1 | fi t ). When the number of regime 
changes in the sample is representative of the underlying distribution 
of regime changes, from which the rational expectations of agents are 
generated, then the forecast error for regimes will exhibit a zero mean 
and the forecast errors will exhibit the standard rational expectations 
errors. 

An interesting extension of the peso issue considered by Evans and 
referred to by Kaminsky (1993) as the generalised peso problem is when 
agents do not know which current regime they are in (for instance 
the central bank’s preferences). Evans illustrates some implications 
by assuming that the degree of uncertainty of the regime is given by 
Pr (Z t | fit). When the regime is known and there is no uncertainty 
Z t = z and Pr (Z t | fi t ) = 1. When Pr (Z t | fi t ) ^ 1 Evans discusses some 
implications in the following manner. First we employ the identity 

Pr(Zt+i — 0) = Pr(Z 4+ i = 0 | Zt = 1) — Pr(Z t+ i = 0 \ Z t = 1) 

+ Pr(Z t+ i=0) (188) 

where for simplicity we have dropped the notation for the information 
set fit- (We read e.g. the second term of (188), Pr(Z t+ i = 0 | Z t = 1), as 
follows: the probability of Z t+ 1 = 0 given Z t = \. all conditional on the 
information set fit). 

We substitute (188) into (187) to obtain 

e*+i(l) = Ut +i + AE(y t+ i) Pr(Zt+i = 0 | Zt = 1) — 

Pr (Z t+1 = 0 | Z t = 1) + Pr(Z t+ i = 0)] (189) 

or 

et+i(l) = Ut+i + AE((/t+i){Pr(Zt+i = 0 | Zt = 1} — 

AE(y t+1 )[Pv(Z t+1 = 0 \ Z t = 1) - Pr (Z t+1 = 0)] (190) 

Now 

pr(Z t+1 = 0 | Zt = 1) - Pr(Z t+ i = 0)] (191) 

the last term in (190), is equal to 

[Pr(Z t+ i = 0 | Z t = 1) - Pr(Z t+ i = 0 | Z t = 0)] Pr(Z* = 0) (192) 
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since equating (191) and (192) and rearranging gives the definitional 
statement 


Pt'iZt+i — 0) — Pr(Z t+ i — 0 | Zt — 1)[1 — Pr(Z t — 0)] + 

Pr (Z t+1 = 0 | Z t = 0)[Pr (Z t = 0)] (193) 

(note 1 — Pr(Z t = 0) = Pr (Z t = 1)). 

Consequently the last term in (190) can be rewritten as 

-AE(y t+1 )[Pv(Z t+1 = 0 | Z t = 1) - Pr (Z t+1 = 0 | Z t = 0)] Pr (Z t = 0) 

(194) 

Recognizing that the first two terms on the right-hand side of (190) 
are the same as those in (187) we see that uncertainty about the current 
regime manifests itself in an additional term in the forecast error given 
by (194). We observe that this term will only be zero if the probability 
of regime 0 at time t + 1 is independent of the current regime or of 
course that within-regime forecasts are equal. In addition we observe 
that changes in Pr (Z t = 0) (perhaps due to learning about the regime) 
will contribute to the structure of the error term. 

Evans (1996) documents a number of other interesting implications 
of the peso problem. In the context of asset prices, where current values 
depend on expected values of fundamentals into the indefinite future, 
he demonstrates how news about future fundamentals can influence cur¬ 
rent prices through the normal channel of changed forecasts of future 
fundamentals as well as the additional channels of the difference in fun¬ 
damentals in the different regimes as well as the dynamics of regime 
switching. In the context of empirical work he demonstrates how peso 
problems can contribute to the explanation of some of the puzzles found 
in asset markets, such as the bias of the forward premium. Bekaert, 
Hodrick and Marshall (2001) show how peso problems can contribute to 
an explanation of anomalies in the term structure of interest rate (see 
below). 

The peso problem is that sample statistics are not representative of 
the population so that statistical inferences are potentially misleading. 
It is clearly of considerable importance in evaluating models of expecta¬ 
tions. 

Statistical Rationale 

An alternative statistical rationale for the forward premium bias is sug¬ 
gested by Baillie and Bollerslev (1997). An empirical regularity in the 
exchange market is that the statistical properties of changes in the spot 
rate and the forward premium are markedly different. In monthly data 
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the variance of changes in the spot rate is typically some 100 or so times 
greater than the variance of the forward premium. In addition whilst 
changes in the spot rate typically exhibit little evidence of serial corre¬ 
lation and can often be well approximated by martingales the forward 
premium typically displays very persistent slowly decaying autocorrela¬ 
tions which may be described as a fractional process - see e.g. Baillie and 
Bollerslev (1994), Byers and Peel (1996), also the Time-Series Annex). 
The statistical properties of changes in the spot rate and the forward 
premium imply that estimates of (3 2 in small samples may be fragile. 
Baillie and Bollerslev show this is the case. They report estimates of j3 2 
in the Fama regression obtained from a rolling sample of data. Employ¬ 
ing monthly data for the DM/dollar they consider 208 five-year rolling 
regression estimates for (3 2 obtained by beginning in March 1973 and 
using a total of sixty observations through to February 1978. Then the 
next estimate was obtained by using data from April 1973 through to 
March 1978, until the final estimate was based on data from December 
1990 through to November 1995. The estimates of (3 2 display marked 
instability varying between —13 and 3.52. These findings suggest that 
for many sample sizes encountered in practice, the estimate of /3 2 is likely 
to be uninformative about the true value of (3 2 . 

They provide further support for this view by simulating data from 
a known structure which is calibrated to produce the stylized statistical 
properties exhibited by the spot and forward premiums. The expecta¬ 
tion embodied in the forward premium is rational by construction. Their 
simulated models are found to generate results which are similar to those 
reported in the literature. In particular, the forward premium exhibits 
persistent autocorrelation and the estimates of /3 2 are widely dispersed 
between negative and positive numbers.These empirical findings are of 
some interest since the statistical properties exhibited by the dependent 
and independent variables in the Fama regression are qualitatively sim¬ 
ilar to those exhibited in other tests of market efficiency such as bond 
rates (see below). 

McCallum (1994) illustrates another important point that needs to be 
given consideration when testing market efficiency relationships. This is 
whether the relationship is invariant to government policy. To illustrate, 
McCallum assumes that the monetary authorities manage interest rates 
so as to smooth their movements while also resisting changes in exchange 
rates. 

Defining x t = Rt — Rf he considers the policy rule 

Xt = A (St — St- 1 ) + axt-i + e t (195) 

where A, a are constants and et is a random error. 
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Uncovered arbitrage is given by 

Xt = EfSt.+i — St + Vt (196) 

McCallum interprets Vt as measurement error though it could be 
interpreted as a risk premium. He assumes that 


vt = pv t -i+u t (197) 

where p is a constant and Ut is a random error. Assuming a solution for 
changes in the spot rate of the form 

S t — St -1 = Ax t _ i + But + Cet (198) 


and equating coefficients we obtain the solution as 

Ut 


q q _ iP _ I 

X>t ~ 1 — -^- Xt-i + 


(A + <T — p) 


e* 

A 


(199) 


(note that x t -i = Rt-i — R{-i = Ft-i — St- 1 ). The solution for x t can 
also be obtained as 


x t = pxt-i + 


Xut 

(A + a- p) 


( 200 ) 


The important point to note is that the coefficient on Xt-i (= F t -1 — 
St -1 via the covered arbitrage condition) in (199) contains the policy 
parameters A and a. If er is close to unity and greater than p and A is 
positive, then the coefficient can be negative. In addition, the forward 
premium will exhibit persistent serial correlation for p close to unity and 
if the variance of u t is very small relative to the variance of e t then 
changes in the spot rate will be closely approximated by a white noise 
error process even though the true process is a persistent ARMA(1, 1) 
process. 


This concludes our discussion of tests of market efficiency in the ex¬ 
change market. It has emphasised the considerable difficulties in deter¬ 
mining from ex-post regression whether this market is efficient. 

We next consider, more briefly, the implications of market efficiency 
for the behaviour of interest rates. 


LONG-RUN INTEREST RATES UNDER MARKET 
EFFICIENCY 


There are a great variety of bonds. Bonds are issued by private compa¬ 
nies, local authorities and by the government. Naturally the probability 
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of default can vary across bonds with different issuers. But government 
bonds are normally regarded by investors as free of default risk and it 
is the pricing of these types of bonds in an efficient market that we 
will consider here. Government bonds can be broadly categorized as 
being of two main types. The first type are bonds which pay a regular 
coupon, usually semi-annual, which is a fixed fraction of the face value 
of the bond up to the maturity date when the face value is also repaid. 
The other type are zero-coupon bonds, also called discount bonds which 
make one payment at the maturity date of the bond. Bonds are offered 
with a great variety of maturity dates, from short up to undated (con¬ 
sols in the UK). Bonds paying coupons can be interpreted as packages 
of zero-coupon bonds with one corresponding to each coupon payment 
and one corresponding to the repayment of the principal and coupon at 
the maturity date. For an individual trader, long-run bonds are substi¬ 
tutable for short-run bonds, since it is possible for him to hold a series 
of short bonds rather than a long bond over the same holding period, or 
conversely to hold a long bond for a short period and then sell it rather 
than hold a short bond to maturity. 

More formally, assume initially that traders are completely certain of 
the future. Initially we consider pure discount bonds so that the return is 
simply the discount from par at which the bond is sold at the beginning 
of the period relative to the redeemed par price at the end of the period. 

The price, P„ t , at time t of a bond which makes a payment of £1 at 
time t + n is given by 


Pnt 


1 

(1 + Rnt) n 


( 201 ) 


where R nt is the bond’s yield to maturity. 

In a world of certainty the pure expectations hypothesis assumes that 
the the following condition must hold: 


(1 + R n t) n — (1 + -Rli)(l + -Rli+l)(l + Rlt+ 2 ) ■ ■ ■ (1 + Rlt+n-l) (202) 


The left hand side of (202) is simply the rate of return on holding an 
n-period bond until maturity. The right side of (202) is the rate of return 
implied by holding a one-period bond for one period, then reinvesting 
the proceeds (principal plus interest) in a one-period bond for the next 
period and so on. 

By taking logarithms and recalling that for small values of the frac¬ 
tion Z , ln(l + Z) can be approximated by Z , we can rewrite (202) as: 

Rnt = -(Rlt + Rlt+l + Rlt+2 + . • • + -Rlt+n-l) 


(203) 
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In other words (203) informs us that long-run interest rates are simply 
averages of future interest rates over the time period to maturity. It tells 
us, for instance, that if short-run interest rates remain constant for the 
indefinite future, then long rates will be equal to short rates. Conversely, 
if future short rates are expected to fall, the current long rate will be 
below the short rate. The relationship is known as the expectations 
theory of the term structure of interest rates. 

When we relax the assumption of perfect knowledge of the future 
and recognize that traders can observe the current long rate R n t and the 
current one-period short rate Ru, it follows if traders are risk neutral 
that: 

Rnt = —[Rit + EtRit+i + E t Ru+2 + ■ • • E t Rit+ n -i] (204) 
n 

If traders are risk averse, then we must add a risk premium to the 
right-hand side of (204) but this does not affect the argument, provided 
it is constant. 

If we consider a bond with a two-period maturity date then 


R2t = 2^ 14 ^t Plt+l] 

(205) 

We can rewrite (205) as 


Et.Rlt+l — Rit = 2(i?2i — Rit ) 

(206) 


or assuming rational expectations that 

Rit+i — Rit = 2(i?2t — Rit) + Vt+i (207) 


where ty+iis the expectations forecast error. Accordingly we observe 
that the expected change in yield on a one-period bond is related to 
twice the difference between the yield on a two-period bond and the 
yield on a one-period bond and that the actual change differs from this 
term by an expectational error. The expectations hypothesis implies 
testable implications such as this for bonds of different maturities. We 
now examine some of these implications further. 

We first introduce the concept of the holding-period return, H nt + 1 , 
which for simplicity we assume is one-period. This is the return from 
purchasing an n-period bond at time t and selling it at time t + 1. At 
time t + 1 the bond will become an n — 1 period bond and will be sold 
for a price P„_i t+ i . 

From (201) 


_ -Pn-lt+l _ (1 + R,it) n 

Pnt (1 + Pn-lt+l)™ _1 


(208) 
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The expectations hypothesis also implies that 

Rit = Et[H n t+ 1 ] (209) 

Equation (209) tells us that the yield on a one-period bond should 
equal, ignoring any risk premium, the expected holding-period return 
on an n-period bond held for one period. Taking logs of (208) and 
approximating as above, and then taking expectations, we can use (209) 
to eliminate the expected holding-period yield. Manipulation of the 
resultant gives: 

EtiRn-u+1 - Rut) = {Rnt ~ f (210) 

n — 1 

Equation (210) informs us that when the spread (the difference in 
yield between the long-maturity and short-maturity bond) is positive, 
changes in yields on long-maturity bonds are expected to increase. Camp¬ 
bell, Lo and Mackinlay (1997) and others have tested (210) by estimating 
the equation 

Rn-lt+l ~ Rnt = a + (3- -—-h £( (211) 

n — 1 

where e* is the forecast error. 

The estimate of /3 should not differ from unity when equation (211) 
is estimated for discount bonds of different maturity dates. The results 
are not supportive of the expectations hypothesis. Estimates of (3 are 
often negative, particularly for bonds of long maturity dates. 

A variety of explanations exist for this anomalous finding and they 
mimic the explanations for the anomolous empirical finding in the ex¬ 
change market discussed above whereby the forward premium is nega¬ 
tively related to future changes in the spot rates. These are time-varying 
risk premia, peso problems, goverment policy rules and low statistical 
power of the regressions. 

We can also write (204) in the form 

Rnt — Rit — — [Ru + (EtRit+1 — R\t + Rit) + {Et.Ru+2 — Ru + Ru) + 

n 

■ ■ ■ + (EtRit+n-i — Rit + -Rit)] — Rit (212) 
or 

Rnt - Rit =E t jr ARlt+i ~ 1 (213) 

z —' n 

2—1 

Equation (213) informs us that the spread is a predictor of future 
changes in short run interest rates. This implication can also be empir¬ 
ically tested. One method is to replace the expected terms on the right 
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hand side of (213) by their actual values and regress the resultant on the 
spread, employing an estimator that corrects for the overlapping expec- 
tational errors. The coefficient should not differ from unity. Campbell, 
Lo and Mackinlay (1997) report such estimates. Their results suggest 
a U-shaped pattern in the estimates for different maturities. For short 
maturities, the spread has a positive coefficient which is less than unity 
and declines initially, becoming insignificant with the maturity horizon. 
At longer horizons a significant positive coefficient is obtained sometimes 
greater than unity. It thus appears that the spread has ability to predict 
short-run interest rates changes for both relatively short and long hori¬ 
zons but not intermediate horizons. The empirical results can again be 
explained by reference to the arguments explaining previous anomalous 
findings above. 

We now turn to analysis of coupon-paying bonds. 

The yield to maturity, or long-run interest rate Y t n , on an n-period 
bond is determined by the fact that the price P™ of the bond is the 
present value of a coupon (C), assumed paid at the end of each period, 
and repayment of the principal (normalized here to unity) at the terminal 
date discounted by Y t n . We have that 


pn _ 

r t — 


C 


C 


C 


1 + Y t n (1 + Y t n ) 2 (1 + Y t n ) 3 


+ ...+ 


C 


(i + Y t n y 

i 

+ 


(i +y t n y 

Letting u = 1+ ^„ we can multiply (214) by u to obtain 

uP t n = Cu 2 + Cu 3 + Cu 4 + ... + Cu n+1 + u n+1 
Subtracting (215) from (214) we obtain 

P t "(l - u ) = Cu - Cu n+1 +u n - u n+1 


(214) 


(215) 

(216) 


Dividing (216) by 1— u, noting that P”(l— u) = Cu(l—u n )+u n {l—u), 
and ~t~~ = w, rearranging after substitution back for u we obtain that 

1 U Y t 


pr = 


c 


+ 


y t r 


- c 


Y t n ' *?(1 + W 


(217) 


If Y t n = C, P" = 1, and conversely. Bonds with this characteristic, 
whose price today equals the principal paid at maturity are selling ‘at 
par’. 

We also notice from (217) that for an undated security or perpetuity, 
n — oo and so P t = so that the price is the coupon divided by the 
very long-term interest rate, Y t . 
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The one-period holding yield, iJ”, on a n-period coupon-paying bond 
consists of both the yield in the holding period plus the capital gain or 
loss. This gives us 


H? = 


t +1 


C 


(218) 


recalling again that an n-period bond at time t becomes an n — 1 bond 
at time t + 1. 

We note from (218) that for a perpetuity, n = oo, 


H t =Y t - ^ Yt) (219) 

Y t +1 

Equating the expected holding-period yield from the perpetuity to 
the yield on a short-term coupon-paying bond we have that 

Y i =Yt _ (Y t+ i -Y t ) ( 2 20) 

Y t+ i 

If we take a first-order Taylor expansion of the last term on the right- 
hand side of (220) around Y we obtain 


Yt 1 = Yt-1 + 1 + n 

Y y 


Rearranging (221) we obtain in expectational form 


EtYt+j. -Y t = Y(Y t - Y) 1 ) 


( 221 ) 


( 222 ) 


We observe that expected changes in the yield on the perpetuity are 
positively related to the spread between the yield on the perpetuity and 
the yield on the short maturity bond. 

Equation (222) can also be solved forward to obtain the solution for 
Y t as 


Y t = 


Y 

1 + Y 


OO 




Y 1 

1 t+i 

(i +Yy 


(223) 


so that the perpetuity is a weighted average of expected future short 
yields. Clearly (223) is a linear approximation to a nonlinear structural 
equation and its ability to approximate will naturally be dependent on 
interest rates not being too volatile so that the approximation remains 
reasonably accurate 

In the more general case of coupon-paying bonds with an n-period 
horizon the derivation can proceed in a similar manner. We substitute 
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for the price of the bond from (217) into the holding period yield (218). 
This gives us the rather messy relation 


H? 


C + 


+ 


Y" 


1 (i+^i )" 


( 


_c 

Y t " 



- 1 


(224) 


(In Shiller, 1979, equation (224) is linearized around Y t n = Y/fi 1 = Y = 
C .) This gives the relationship 


Ht = 


Y n — v Y n ~ l 

1 t ln 1 t+l 

1 ~7„ 


(225) 


where 7 „ = {1 + Y[ 1 - 1/(1 + y)"- 1 ]- 1 }- 1 

Equating (225) to a short yield, l/ 1 (plus any constant risk premium) 
we can solve the resulting difference equation for Y t n . given the terminal 
condition that the price of the bond is 1 at t + n, as 


! 7' = r/ E 'ET' 5 'i.u. 

' i =0 


(226) 


where (f> n is any constant risk or liquidity premium. Equation (226) 
informs us that the n-period yield on a coupon-paying bond is a weighted 
average of the expected yields on the one-period bond. 

Shiller exploits these properties to evaluate empirically whether long- 
run bond yields are too volatile to be consistent with rational expecta¬ 
tions and the observed volatility in short rates (a test analogous to the 
variance bounds test for stock prices outlined above). His empirical find¬ 
ings are inconsistent with rational expectations. Again the results could 
be reconciled by appeal to the sort of issues raised above. 


LEARNING AS AN ALTERNATIVE TO RATIONAL 
EXPECTATIONS: 

The tests we have been examining all assume rational expectations. One 
possible cause of failure could therefore be that expectations are not ra¬ 
tional but rather the result of a learning procedure — of which adap¬ 
tive expectations is an approximation, as shown by Benjamin Friedman 
(1979). 

We observed in the appendix to chapter 2 that if a series followed an 
ARIMA(0,1,1) process then an adaptive expectations scheme could in 
fact be a representation of the rational expectation. Earlier in this chap¬ 
ter we also found that a regressive expectations scheme could be rational 
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in the Dornbusch overshooting model. In general of course these meth¬ 
ods of expectations formation will not have the rational expectations 
property. They illustrate however that in any particular model struc¬ 
ture, since the rational expectation solution can be written in terms of 
observable variables, there is a ‘mechanistic’ method of expectation for¬ 
mulation which will have the properties of rational expectations for the 
particular model structure. In this book we are primarily concerned with 
investigating the properties of models when it is assumed that agents’ 
expectations, or the aggregate of agents’ expectations, have the rational 
expectations property. Maintained assumptions of this approach are that 
agents know the true model, or act as though they did, and also assume 
that other agents also possess this information (see Townsend, 1978). In 
this approach econometricians will come across cases of ‘adaptive’ expec¬ 
tations which by chance represent the rational forecasting mechanism. 

However there is an important literature that relaxes these assump¬ 
tions and assumes that expectations follow a learning rule,which has 
the potential to converge to rational expectations (see e.g. Evans and 
Honkapolija, 1999, 2001, and Sargent, 1993). From this perspective the 
adaptive or regressive expectations schemes can be interpreted as simple 
learning rules which have converged on the rational expectations solu¬ 
tion. A variety of learning rules, in general possibly more intelligent, 
have been studied. One such example is least-squares learning whereby 
agents employ least-squares regression to estimate the parameters of a 
model and employ the resulting model to forecast the variable of interest. 
This approach to learning models agents in the same way as economists 
who employ econometrics and statistical inference in deciding between 
competing models. Employing this approach also highlights another as¬ 
pect of rational expectations that agents in the model are assumed to 
possess more information than the outside observer. 

One motivation for studying learning rules is to ascertain whether 
they converge on rational expectations and how fast this process is. 
Answers to such questions may be relevant for some in deciding how 
plausible the rational expectations assumption is. 

Whilst the implausibility of the assumption that agents know the 
true model may be sufficient justification for some for studying the im¬ 
plications of the learning assumption there are other properties of the 
rational expectations assumption that have been used to justify such an 
approach. 

The first of these is the issue of ‘non-uniqueness’ (as set out in chapter 
2). When the model exhibits more than one stationary solution methods 
that are not part of the formal model structure have to be employed to 
determine which solution is chosen by agents. The solution chosen by 
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different learning rules has been widely studied. However, as we argued 
in chapter 2, ‘non-uniqueness’ should really be seen as the failure of a 
stability condition for the ‘forward’ root. The ultimate remedy is to find 
a specification which satisfies stability conditions that presumably hold 
in the real world. 

The second justification is when a model has more than one equilib¬ 
rium solution. For example, take the widely used model of hyperinflation 
which has been extensively studied in this context. If we assume a gov¬ 
ernment prints money to finance a constant budget deficit, then 

PtG t = Mt — Mt~ i (227) 


where P t is the price level, G t = G is the constant real deficit, and M t 
is the money stock. 

Assume a demand function for money of the form 

^ = f(E tPt+ 1) (228) 

where E t pt +i = E t (\og{Pt+i / Pt) is the expected rate of inflation: 


dM l 
u P t 

dE t p t+ 1 


(229) 


and real output has been assumed constant. 

Assuming money demand is equal to money supplied we can substi¬ 
tute (2) into (1) to obtain 


G = f(E tPt+ 1 ) - f(E t _ lPt )e-r * (230) 


(since logpp^ = Pt.-p~ = e Pt ) 

Equation (3) has two equilibrium solutions if G is not too large (where 
EtPt+i = E t _ip t = Pt = p)- Intuitively if inflation is zero in equilibrium 
the authorities generate no receipts, whilst if inflation is infinite agents 
hold no money so receipts are also zero. Consequently the equilibrium 
surface G = g(p) exhibits two equilibria for G < G max . Assuming ra¬ 
tional expectations the equilibrium exhibiting higher inflation is mathe¬ 
matically stable and the lower one unstable. These rankings are reversed 
assuming adaptive expectations. If the view is taken that mathematical 
stability is not the appropriate selection criteria in a rational expecta¬ 
tions model then there is no mechanism, which is part of the formal 
model structure, to choose between the two equilibria. More generally a 
rational expectations models could exhibit multiple stable equilibria. 

A third justification is structural change. Say a new government or 
a new central banker appears, it is natural to model agents as learning 
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about the new regime. How ‘wet’ or ‘dry’ the new governor is and how 
the probabilities change as new information accrues seem best analyzed 
in a learning framework. 

We will now briefly illustrate some examples of learning mechanisms. 
Evans and Honkapohja (1999) have divided the approaches into three 
groups and we follow their taxonomy. 

Eductive approaches 

In this literature researchers investigate whether the coordination of ex¬ 
pectations on an rational expectations equilibrium can be attained by 
a mental process of reasoning. We illustrate with an example based on 
DeCanio(1979). 

Suppose the demand and supply in a market are given by 


q t = a - bp t + w t (231) 

<7t = c + dE t _ip t + Vt (232) 

where pt is the price level, w t and ty are random disturbances and a, b, c 
and d are constants. 

The reduced form for prices in this system assuming demand is equal 
to supply is given by 

Pt = ^ - jEt-m + =A- BE t -ipt + u t (233) 

where the definitions of A,B,u t are obvious. 

Suppose agents form their expectations initially in an arbitrary man¬ 
ner.The question is whether they can modify their behaviou r in such 
a way as to lead them closer to rational expectations, given by 
Suppose the initial, arbitrary expectation of all agents is given by 

E t-iPt = Pt -i (234) 

From (232) given this expectation the actual evolution of prices will 
be given by 

Pt= A — Bpt-i + u t (235) 

DeCanio assumes that after some passage of time agents realize (rea¬ 
son or deduce) that prices are evolving according to (234) and form the 
new expectation 


E t-iPt = A- Bp t -1 


(236) 
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However this new expectation changes the evolution of the system to 

p t = A — B{A — Bp t - 1 ) + tit — A — BA + B 2 p t -i + u t (237) 

Agents observing the new evolution of prices in the market agents 
revise expectations to 

Et_iPt = A — BA + B 2 pt -1 (238) 

so that actual prices evolve as 

Pt — A — B(A — BA + B~pt~ i) + Ut — A — BA + B~ A — B 3 pt—i + Ut 

(239) 

Continuing in this manner after n iterations we will have 

E”_iPt = A - BA + B 2 A - B 3 A + - + AB n + B n p t (240) 

If \B\ < 1, for large n expectations will converge to the rational expec¬ 
tation 

K-iPi = y^b (241) 

since jAg =1 — B + B 2 - B 3 -\ -h for \B\ < 1. 

Clearly convergence to the rational expectation is not guaranteed 
even in the simple example if \B\ > 1. When iterative expectations con¬ 
verge on the rational expectations solution the rational expectation is 
said to be iteratively E-stable. The iterative expectations of agents were 
assumed to be homogenous in the above example. When convergence 
occurs and the iterative expectations of agents are heterogeneous as in 
Guesnerie (1992) the rational expectations model is said to be strongly 
rational. Evans (1985, 1986) employ the iterative expectations method 
in models embodying multiple solutions, Peel and Chappell (1986) in a 
model embodying multiple equilibria and Bullard and Mitra (2000) in a 
model where agents learn about monetary policy rules. 

Adaptive Approaches 

Early on Benjamin Friedman (1979) argued that agents would learn 
from data via regression about the model and the policy regime. This, 
he pointed out, would produce expectations formation very similar to 
adaptive expectations without necessarily ever leading to rational ex¬ 
pectations. Such statistical learning was subsequently examined to see 
whether it would converge on rational expectations. We consider the 
least-squares learning mechanism initially analyzed by Bray and Savin 
(1986) and Fourgeaud, Gourieroux and Pradel (1986), though we note 
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that more complicated estimation procedures such as neural nets and 
genetic algorithms have been employed (see Sargent, 1993). 

Suppose for simplicity that the reduced form for prices follows the 
process 


Pt — A + BE t -iPt + Czt—i + Ut (242) 

From (241) the rational expectation is given by E t -iPt = 
so that prices evolve as 

Pt — A + - —-Zt-i + u t = A + Gzt-i + u t (243) 

1 — JD 

Suppose agents believe that prices follow the process given by (242) 
but are unaware of the values of the parameters A and G. In the least- 
squares approach to learning agents are assumed to run least-squares 
regressions of p t on z t - 1 and an intercept using previous data on the 
variables. Expectations are then generated from the estimated model. 
As more data becomes available the model is then reestimated, expec¬ 
tations formed and so on. Researchers have demonstrated that the con¬ 
ditions for convergence of recursive least-squares expectations can be 
weaker than those under iterative expectations. (B < 1 in this model as 
opposed to \B\ < 1 with iterative expectations.) 

In the case of least-squares learning where agents perceive the reduced 
form as 

y t = Pxt + et (244) 


where p is a vector of coefficients, x t a vector of explanatory variables 
and et an error, the least-squares estimated coefficients are given by 

(3 t = XiX^j Xiy^j (245) 

It can be demonstrated that the recursive least-squares estimates are 


generated as 




Pt = P t _ x +l t R t 1 x t -i{y t -i - Pt-iXt-i) 

(246) 

and 




R t = -Rt—i + 'y t (x t -ix t _ 1 - R t - 1 ) 

(247) 


with 7 1 = j. and where Rt is an estimate of the moment matrix for ay. 

t-i ; 

For suitable initial conditions Rt = x i x i- 
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We note that the term = | is known as the ‘gain’. It is important 
in determining the speed of convergence to the true parameter. Some 
intuition of the least-squares updating formulae can be obtained by con¬ 
sidering the recursive least-squares estimate of the mean Ez t = /i. The 

t 

least-squares estimate is the sample mean z t = \ Y z n■ 

n= 1 

Subtracting the sample mean at t — 1 from both sides of Zt and 
rearranging gives 

z t = Z t -1 + j(zt -Z t -i) (248) 

£ t- 1 £-1 

since tz t = ^2 z n — zt+ z n an d (£ — l)^£-i = ^ 4 so that t(z t — 

n= 1 n= 1 n= 1 

£-1 

Zt- 1) = Z t + Yj z n- tZt-l 
n= 1 

= Zt + (t- l)z t -l - tz t - 1 = Zt- Zt- 1. 

Adaptive methods of learning have the same general type of structure 
which is given by 

St = Ot-i + XtQ(t,dt-i,X t ) (249) 

where 9 t is a vector of parameters, A t is the gain parameter equal to j- in 
the case of least-squares, Q is a function and X t is the vector of variables 
in the structural model. We note that adaptive expectations is a special 
case of (248) where the gain parameter is constant.. 

The evolution of X t will depend on 9 t - 1, in the case of a linear system 

X t = A(6t-i)X t -1 + B(9 t -i)W t (250) 

where Wt is a vector of disturbance terms. 

Marcet and Sargent (1989a,b), Evans and Honkapohja (1998) derive 
stability results for linear (and nonlinear) systems. Sargent (1999) as¬ 
sumes the US authorities used constant-gain least-squares learning about 
the Phillips Curve and maximised a social objective function to pick in¬ 
flation; he argues this fits US post-war data, accounting for the ‘great 
inflation’ where rational expectations cannot. 

Rational Learning 

Rational learning has to be interpreted from a perspective that acknowl¬ 
edges the benefits and costs of more accurate forecasts for an agent as 
in Feige and Pierce (1976) or Evans and Ramsey (1992), so that ratio¬ 
nal expectations may not be attained unless calculation costs are zero. 
However the method most widely employed to model rational learning 
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has been to employ Bayes’ rule. Bayes’ rule is the basic property of con¬ 
ditional probability and is a method of updating our belief or probability 
of event or hypothesis A given new evidence B. Specifically, our poste¬ 
rior belief P(A/B) is calculated by multiplying our prior belief P(A) by 
the likelihood P(B/A) that B will occur if A is true. To see this we can 
rearrange the conditional probability formula to get: 


P(A/B)P{B) = P(A,B) 

where P(A,B ) is the joint probability of A and B. 
By symmetry we also have 

P{B/A)P(A) = P(A,B) 

It follows from (250) and (251) that . 

P(B/A).P(A) 


P(A/B) = 


P(B) 


(251) 


(252) 


(253) 


Equation (252) is called Bayes’ rule or Bayes’ theorem. The formulation 
carries the implication that beliefs change by learning: agents come to 
know of a new fact and form their posterior belief by conditioning their 
prior belief on these facts. 

An alternative form of Bayes’ rule is given by 


P(A/B) = 


P(B/A).P(A) 

J2P(B/Ai).P(Ai) 


(254) 


since P{B ) = ^P(B / Ai).P(Ai) where A,: refers to the event space. 

i 

From (252) we observe that the data or new facts B only influence the 
posterior inference through the function or probability P(B/A) which is 
called the likelihood function. 

The ratio of the posterior probability evaluated at the two events Ai 
and A 2 is called the posterior odds. 

We have 


P{Ai/B) PjA^PiB/A^/PjB) P(Ai)P(B/Ai) 

P(A 2 /B) P(A 2 )P(B/A 2 )/P(B) P(A 2 )P(B/A 2 ) 1 ; 

In words, the posterior odds are equal to the prior odds multiplied 
by the likelihood ratio. 

Bayes’ rule has been widely employed to model learning in the eco¬ 
nomics literature — see e.g. Cyert and DeGroot (1974), Backus and Drif- 
fill (1985), Ellison and Valla (2000), Lewis (1998) Sill and Wrase (1999), 
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Townsend (1978). Types of problem analyzed employing Bayes’rule in¬ 
clude learning about a new regime. We will give one example which is a 
slight modification of Lewis (1988). She demonstrates how beliefs that 
a policy process may have switched can induce apparent ex-post biased 
forecasts of exchange rates even after the switch has occurred. In addi¬ 
tion, in her model, exchange rates may appear to contain a speculative 
bubble component since they will systematically deviate from the levels 
implied by observing fundamentals ex-post. 

Assume the reduced form for the exchange rate (see chapter 14) is 
given by 

s t = m t + a(E t s t+ 1 - s t ) (256) 

where St is the money supply at time t, St is the exchange rate and a is 
a positive constant. 

Assume the money supply process is given by 

m t =9 0 +£t (257) 

where 9q is a constant and is a normally distributed random variable 
with mean zero and variance CTg. 

Suppose at a particular point in time, say t = 0, agents come to 
believe that the money supply process may have changed, due to an 
exogenous process such as a change in government or a statement by 
officials. The new process has for simplicity the same form as the old 
except with a different mean and variance: 

m t = + e\ for t > 0 (258) 

It is assumed that 9\ < 9 o and 9\ = 0, so that the process can be 
interpreted as going from ‘loose’ to ‘tight’ money. Agents are not sure 
which money supply process is in operation. It is further assumed for 
simplicity that agents believe that if policy has changed it will not be 
changed back and they also know the parameters of the potential new 
process. 

Solving (29) forward we obtain the solution 

OO 

* = (1-7) E l %E t m t+i (259) 

i=0 

where 7 = iq^- 

Expected money supply given the assumptions above is equal to 

E t m t +i = #o(l — Pit) for any i > 0,t > 0 (260) 

where Pit is agents’ assessed probability at time t that the process 
changed at time 0 . 
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Consequently the exchange rate is given by 


st = (1 - i)mt +7(1 - PitWo (261) 


(note EtSt+i = 9q{\ — Pit) from (30) and (31) since = 1+ 7 1 -H 7 2 + 
- + 7 00 and = 1 - 7 ) 

To obtain the best estimate of Pit, agents combine their prior beliefs 
about the probability together with their observations of money out¬ 
comes each period to update their posterior probabilities according to 
Bayes’ Rule. 


Pit-M/Oi) 

Pit-if(It/0i) + Pot-if(It/0 o ) 


(262) 


where Po t is the probability of no change at t = 0 , /( I t /6 J is the prob¬ 
ability of observing the information set It given that nit follows the ?'th 
process. The posterior probabilities of each process, the posterior odds 
ratio, is given by 


Pit 

' Pit-if(m t /9iY 


[Pit- 1 ] 

Pot 

_Pot-if(m t /9 0 )_ 


Pot-i. 


(263) 


The first term on the right-hand side of (262) shows that the change 
from t — 1 to t in the relative conditional probabilities depends upon 
the observation of the current money supply at time t. For example for 
some observation of current money supply, say m, the probability of be¬ 
ing under either regime is equal, so that the posterior probabilities,^, 

p 

are equal to the prior probabilities p 1 * -1 ■ Observations of money sup¬ 
ply different from m convey information about the regimes causing the 
probabilities to be revised. The last term on the right-hand side of 
(262) quantifies this information. For a normally distributed error the 
sampling distribution for a single scalar observation, m, from a normal 
distribution parameterized by a mean 9 and a variance a 2 is given by 

{m-ej 2 

P{m/9) = 6 JT (264) 

V 27T<7 

Clearly the information accruing from an observation will depend on 
the particular distribution of the error term. 

If for simplicity we assume that cto = oq = a then we can write (36) 
as 


ln( ■ 


It N 

V 


= In 


lt-i 

D 

0£—1 


+ In 


f(rnt/0i) 

f(rnt/9 0 ) 


(265) 
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so that for any realization of the money process m k 

f(m k /0i) _ (mfc - dp) 2 - m 2 k 
11 f(mk/0o) _ 2 a 2 


(266) 


Substituting (265) into (266) we obtain a linear difference equation 
that can be solved as 


1 /-Pi U , Pl,0 

ln( p - ) = In tt 

M)t -M ),0 
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fc=i 


(m fe - 6 0 ) 2 -m 2 k 


21 


2 a 2 


(267) 


In equation (266) the evolution of probabilities is seen to depend on 
the actual observations of the process. When the money supply observed 
today is very negative agents will think it more likely that policy has 
changed and vice versa. 

Taking expectations of (266), defining 6i as the true 6 gives 


MM 


1 P bQ , , 
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2a 2 

'61 - 26,60] 
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(268) 

(269) 


Equation (268) illustrates that the expected value of the ‘true’ process 
rises over time. For instance when policy has changed so 6i = 6\ = 
0 , from (268) we observe that the log probability increases to infinity 
because of the trend term ^ (or that Pot goes to zero). When policy 
has not changed so that 6j = 6 0 the reverse is true due to the trend 

_^ q2 

term -^pr- Consequently agents’ assessed probabilities of a policy change 
are random variables determined by random observations of the money 
supply. 

Given this analysis of the evolution of probabilities Lewis is able to 
investigate the effects on the exchange rate and forecast errors.Taking 
expectations at t — 1 of (260) and subtracting from (260) we obtain the 
forecast errors corresponding to each potential process as: 

s t - E t _ lSt = (1 - 7)4 + 6 0 (Pi,t-i - 7-Pit) if 6i = 6 0 (270) 

and 


s t - E t _ lSt = (1 - 7)4 - 6 0 (Po,t-i ~ 7-Pot) if 6i = 6i (271) 


(note E t _iPi t — Pi,t-ii E t _iPo t — Po,t-i)- 

Equations (269) and (270) illustrate that whilst agents are learning, 
expectation errors will exhibit a systematic component and ‘appear non- 
rational ex-posV. 
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Taking expectations of the forecast errors in (270) conditional upon 
a change in policy to 9\ the expected evolution of the forecast errors, for 
a large number of mk is given by 

E(s t - Et^at/di) = -0o[£?(Po,t-i/0i) - 'yE(P ot /9 1 )} < 0 (272) 

The inequality is negative since 7 is less than one and EPo t /9i < 

£?(P o> t-i/0i). 

From (271) we observe that if agents do not fully realize that policy 
has changed the exchange will be expected to be weaker than subse¬ 
quently occurs. Ex-post expectations will appear to be irrational. 

Lewis’ model nicely illustrates how learning about a regime change 
using Bayes’ rule can mimic the outcomes of the Peso problem discussed 
earlier in this chapter. 


CONCLUSIONS 

In this chapter we have discussed some of the implications of the propo¬ 
sition that financial markets are efficient. As particular applications of 
the proposition we examined the behaviour of stocks, exchange rates 
and bond markets. The theoretical work on efficiency in asset markets 
is large and is expanding dramatically so that we have only been able 
to provide a flavour of the debate. Whilst the new generation of mod¬ 
els which stress micro-structure arguments appear of great interest, our 
own view is that the empirical evidence for (approximate) semi-strong 
efficiency in the capital market is sufficiently powerful and convincing for 
it to be regarded as a ‘stylized fact’. In fact it is now standard practice 
for macroeconomic model builders to simulate or forecast the impact of 
changes in government policy within models that assume capital market 
efficiency. 
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APPENDIX 14A1 - INCOMPLETE CURRENT INFOR¬ 
MATION 

The purpose of this appendix is to show the manner in which the stan¬ 
dard test of efficiency based on equation (81) of this chapter has to be 
modified if agents have an incomplete current information set. For ana¬ 
lytical simplicity (though the argument is quite general), we assume the 
series yt is the summation of two infinite moving average error processes 
in the two white noise errors, e and z. 


yt = y + y, met-i + ^ (l) 

2=0 2=0 

.where y is the mean of the series, and the 7iy and <5* are constant coeffi¬ 
cients. 

Consider the rational expectation of yt+i formed at time t. If there 
is full current information at time t the expectation will be given by: 

oo oo 

EtVt+i — y + Tti et-i+i + SiZt-i+i ( 2 ) 

2=0 2=0 

In this case, given the one-period forecast horizon, the ex-post fore¬ 
cast error will be given by a white noise error: 


Ut +1 - E t y t+ 1 = 7r 0 e t+ i + 6 0 z t+ i (3) 

Consequently the standard tests based on (81) are correct in these 
circumstances. 

Suppose next that agents have incomplete current information at 
time t, and instead observe some current global information (for instance 
via asset markets), but other global information with a one-period lag. In 
particular we will assume for simplicity (though the argument is easily 
generalized) that there is one global indicator (say the interest rate) 
which is given the representation: 

OO 

Rt = T + + h iZ t-i) (4) 

i=0 

where r is the mean of the series and the d,; and hi are constant coeffi¬ 
cients. 

In this incomplete current information case the one-period-ahead ex¬ 
pectation of yt is given by: 

OO OO 

Etyt +1 = y + 7Tl E t et + TTi^t-i +1 + S\E t Zt + SiZt-i+l (5) 

2=2 2=2 
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Consequently the forecast error is given by: 

Dt+ 1 ~ E t y t+ 1 = TToCt+i + bo z t+i + 7T i [et — E t e t \ + 8\ [z t — E t z t ] (6) 

Given current observation of the global indicator and using the 
usual signal extraction formulae (as discussed in chapter 3) we obtain: 

Et^t = + ho Zt) (T) 


E t z t = 


t - (f <t>e){do e t + h 0 Zt) 
<i o 


( 8 ) 


where 


dla 2 


die 


hie 


and cr 2 and a 2 are the variances of the two errors, e and 2 respectively. 
Consequently the forecast error (K t +\) is given by: 


Kt+i — TroQ+i + 8oZt+l + T 1 "! 


(1 - e/) e )e t - 


+ 


e z t - - <Pe)tt 


(9) 


Serial Correlation of Forecast Errors 


If we take expectations of two successive errors we find: 

E(K t+ 1, K t ) — | ^ 2(7 2 !|_ hifj'2 | — dido].[noho — <5odo] (10) 

Consequently, in general, incomplete current information will give 
rise to a moving-average error process. This will not be the case if, 
first, we have implicitly full current information (for example, if there 
are as many global indicators in the economy as random shocks; see 
Kami, 1980), or if, secondly, we observe the current value of the variable 
to be forecast (it being itself a global indicator). In this latter case 
7i"o = do, So = ho (also ni = d\, 8\ =h\) and the expected correlation 
in (10) is equal to zero. 

It would appear from this result that standard tests of efficiency based 
on (81) will have the usual properties for asset prices in particular (which 
it can be assumed are observed currently), even under incomplete cur¬ 
rent information. However, the general point remains that variables not 
currently observed (i.e. the majority) will under incomplete information 
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be inappropriately tested for efficiency by these methods. Furthermore, 
one needs to scrutinize carefully the assumption that the asset prices in 
question are contemporaneously observed. In very high frequency data 
(e.g. hourly) this will obviously not be so except for a few continuously- 
broadcast asset prices; it will also not be so in lower frequency data 
for averages of variables (e.g. the level of all short-term interest rates), 
which are often examined in these studies. 

In general, in circumstances of incomplete current information the 
moving-average error in equation (9) will be given by s + j — 1, where 
s is the time horizon of the forecast and j is the longest lag on global 
information relevant for forecasting y t . Clearly there may be some a 
priori doubt as to the magnitude of j, which may cause some problems 
in interpretation of tests based on (81). As a consequence of the moving- 
average error process, least squares estimates of (81) under incomplete 
current information will be inefficient but unbiased, since 

E{Kt+ 1 , E(y t+ 1 )) = 0 (11) 

and least squares estimates have the property of unbiasedness even in 
the presence of moving-average error processes. This situation is the 
same as that of overlapping information in the usually assumed case of 
full current information; overlapping information here occurs with s > 1, 
familiarly introducing a moving-average process with the same effects. 

These results have potential implications for a number of empiri¬ 
cal studies (see e.g. Holden and Peel, 1977; Turnovsky, 1970) in which 
an implicit assumption of full current information has been made when 
studying directly-observed consumer price expectations data which can¬ 
not readily be assumed to be part of the current information set. The 
point here is that price data are not currently observable on any rea¬ 
sonable assumptions. Consequently, the ‘expectations errors’ should be 
serially correlated, as indeed has often been found in these tests. It is 
possible that these survey data may well reveal rationality after all. For 
further implications of partial current information sets for the testing of 
efficiency, see Minford and Peel (1984). 
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APPENDIX 14A2 COMPOSITE MOVING-AVERAGE 
ERROR PROCESSES 

Testing for the efficiency of relationships involving moving-average errors 
poses problems. We wish to form a moving-average error process from 
the composite error process 

av t+ i + bv t + cu t+ 1 + du t (1) 

where v t +i and u t+l (i = 1 , 0) are serially uncorrelated random variables. 
We define the new moving-average process 

<f>t+i + j<t> t = av t+ i + bv t + cu t+1 + du t (2) 

where <f) t+i (i = 1, 0) is a serially uncorrelated random variable. 

The method is to equate the ratio of the variance to the covariance 
of the error processes on the left- and right-hand sides of (2) (where the 
variances and covariances of the left- and right-hand sides are equal by 
definition — a similar method is employed for higher-order composite 
processes). 

We obtain that 

(l+j 2 )cr| _ (a 2 + b 2 )al + (c 2 + d 2 )a 2 u + 2(bd + ac)cov(uv) 
ja ^ after 2 + cda 2 + (ad + bc)cov(uv) 

where <r 2 , <r 2 , a 2 are the variances of cp, v, u and cov(ui>) is the covariance 
between u and v. 

Equation (3) is a quadratic equation in j (note a ^ cancels). The 
root of the equation which has modulus less than unity is chosen so 
that the process can be stationary. Clearly the magnitude of j will 
reflect relative magnitudes of variances and covariances. For example 
if a = 1, b = —1, c = 1 and d = 0, er 2 = <r 2 and cov(uv) = 0 then 
■ _ —3+5 0,5 
J 2 

Using the lag operator it is also useful to note that we can express 
<t>t+i as 


[av t +i + bv t + cu t+ i + dut 
(1 +jL) 

[avt+i + bvt + cut+i + dut]{ 1 — jL + j 2 L~ — j 3 L 3 + ...] (4) 

From (4) we observe that 4> t+1 can be expressed as an infinite sum 
of the past errors v and u. This is important in some tests of efficiency 
under rational expectations. Since the errors in the composite process 
are in principle any errors in the economy if a moving-average error 
process is estimated jointly with the other parameters of the model it 
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will induce correlations between the regressors and the error term which 
will result in biased and inconsistent parameter estimates. For instance 
consider the two-period ahead rational expectations forecast error for 
the process, 


We have that 


oo oo 

yt = v+^2 6iVt ~ i 

i =0 z=0 


(5) 


Vt = Et- 2 Vt + 7o u t + 7i«i-i + fioVt + SiVt -1 (6) 

Although the error term is a moving-average process, in a test of 
efficiency it would be inappropriate to estimate jointly the parameters 
of the model and the moving-average process 

Vt = otQ + a\E t -2yt +</>*+ j<t>t- i (7) 

and test that «o— 0, «i= 1. As shown above the error term can be writ¬ 
ten as an infinite summation of previous errors. This is also a property 
of the forecast so that there will be correlation between the error and 
the explanatory variable. An appropriate procedure is to estimate (7) 
by least squares and employ standard errors which are modified to allow 
for the serial correlation in the error term (see e.g. Hansen and Hodrick, 
1980). 

It is also useful to note that an ARMA forecast of a variable, even 
assuming the underlying model is linear, is less efficient than a rational 
expectations forecast if the reduced form of a variable includes com¬ 
posite moving-average errors. Suppose for illustration a variable, y t , is 
generated by the process 

y t = u t - u t - 1 + v t (8) 

The rational one-period ahead forecast, E t -\yt = —u t -i- Let the 
moving-average process for (8) be given by y t = <f> t — j(p t _ \ (, j > 0) so 
that the one-period ARMA forecast is Ef_ t y t = The associated 

forecast errors are 


y t - E t -iy t = u t +v t 


(9) 


for the rational expectation and for the ARMA forecast 


yt - K-iVt 


lit - u t -1 + v t 
(1 -jL) 


( 10 ) 


Assuming for simplicity that cov(uv) is zero the variance of the ra¬ 
tional expectations forecast is 
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The ARMA forecast error can be written as 


Vt - Ef_ x y t = u t - (1 - j)u t -1 -j{l- j)u t -2 - j 2 ( 1 - j)ut ~3 + • • • 

+ Vt+ jvt-l + j 2 V t - 2 + ■ • ■ (12) 

(Recall that = 1 + jL + j 2 L 2 + j 3 L 3 + ...) 

Although the ARMA forecast error appears serially correlated this is 
in fact not the case as substitution for j in terms of the variances of u 
and v, though messy, will demonstrate. 

The variance of the ARMA forecast error is given by 


a 2 u + a 2 u (l-j) 2 [l+j 2 +j 4 +f+ ...} + 

^{i+.r-/'+/-•••} (13) 


which we can simplify as 




+ 


(1 -f) (1 -j 2 ) 


(14) 


Because j is less than one the variance of the ARMA forecast is 
greater than the rational expectations forecast error. Essentially infor¬ 
mation is lost in forecasting the composite error process. In addition the 
innovation from the ARMA process, <j> t , though serially uncorrelated, 
will not necessarily be orthogonal to variables that are correlated with 
past u and v innovations, due to the implicit dependence of the ARMA 
innovation on these variables. 



15 

Interpreting the Evidence: The 
Problem of Observational 
Equivalence 


As was shown in chapter 4, it is a key implication of an economy embody¬ 
ing a Sargent-Wallace supply curve, rational expectations and identical 
information sets of public and private agents that, from a stabilization 
perspective, output (or the unemployment rate) will be invariant to an¬ 
ticipated rates of monetary change. Only unanticipated changes in the 
money stock will have any impact on output. This appears to suggest 
an attractive test of such a New Classical model; whether it is antici¬ 
pated or unanticipated changes in the money supply that affect output. 
Barro(1977) in a seminal paper attempted to determine empirically the 
relative impacts of unanticipated and actual rates of monetary change 
on the unemployment rate in the USA over the period 1946-73. Since his 
initial paper, he and others have carried out similar tests for a number 
of other countries, as well as repeating the US tests in different ways. 

Barro’s empirical procedure involved two stages. First, a policy reac¬ 
tion function was determined for the authorities. This involved explain¬ 
ing the actual rates of change of the money stock in terms of variables 
previously known to the authorities. The residuals, or unexplained part 
of monetary change, from this reaction function are interpreted as the 
unanticipated rate of monetary change. 

Thus if the reaction function of the authorities is given by: 

Up : Um-V—1 T Umt (1) 

where m is the rate of monetary expansion, X is a vector of variables, 
a m a vector of coefficients, u m t is a random residual. Then 


m t - Et-mit = Umt 


( 2 ) 
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Having obtained estimates of the monetary innovations, Barro pro¬ 
ceeds to use these innovations, current and lagged (along with other 
postulated determinants of the natural rate of unemployment such as 
a minimum wage variable and a measure of military conscription), to 
explain the rate of unemployment. 

Barro’s empirical results appear impressive. They suggest that the 
current and previous two years’ monetary innovations are significant 
determinants of the rate of unemployment with the correct negative sign. 
Moreover, when Barro adds previous rates of change of the money supply 
to his unemployment equation, they are found not to be statistically 
significant. 

The basic type of analysis conducted by Barro has been replicated 
by a large number of different authors for a number of different coun¬ 
tries (see e.g. Barroand Rush, 1980; Leiderman, 1980; Attfielcl et ah, 
1981a, b; Mishkin, 1982). The results reported by all these authors ex¬ 
cept Mishkin seem to suggest that real variables, either unemployment 
or output, are responsive only to unanticipated rates of monetary change 
and not the actual rate. Consequently, it would appear from this em¬ 
pirical evidence that there is powerful support for the joint hypothesis 
of rational expectations and independence of unemployment or output 
from anticipated rates of monetary change. 

Sargent (1976b) in an important paper, showed that this interpreta¬ 
tion of the empirical evidence may be incorrect. Sargent demonstrated 
that the reduced form for output or unemployment in a Keynesian model, 
in which systematic monetary policy will influence the variance of out¬ 
put, may be statistically indistinguishable from that of a classical model 
in which only unanticipated movements in the money stock impact on 
output or unemployment. The Keynesian and New Classical models are 
said to be observationally equivalent in their reduced forms. 

Sargent’s main point can be made most simply by consideration of 
the following model. Suppose the ‘true’ reduced-form equations for an 
economy are given by the equations: 

Ut = a c X t _i + b c (m t - E t ^nn t ) + u ct (3) 

and the money supply reaction function is (1) above. Then £ ( _im ( = 
a m X t -1 and we can rewrite (3) as: 

Vt = (a c - b c a m )X t -i + b c m t + u ct (4) 

which has all the appearance of a Keynesian model. 

Alternatively take a Keynesian model to be the true one: 


Vt — dkXt -1 + bkTrit + Ukt 


( 5 ) 
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By adding and subtracting expected money supply we can change it 
into: 

Ut = (ak + bka m )Xt -1 + bk(m t - E t -im t ) + u kt (6) 

an apparently New Classical model. 

It is obvious from Sargent’s analysis that ‘unrestricted’ single equa¬ 
tion reduced forms for output have no information content, at least with 
respect to the maimer in which changes in the money supply impact on 
an economy. A simple regression of output on the money supply simply 
cannot inform us of the true nature of the impact of monetary change 
on these variables. 

The point that Sargent is making is that two quite different types 
of model, Keynesian and New Classical, have indistinguishable reduced 
forms for output in terms of either actual or unanticipated money stock. 
This does not imply that the models in toto are not distinguishable; 
clearly they are, on the basis of a variety of tests on structural coeffi¬ 
cients and also on reduced-form equations for more variables than simply 
output. Therefore the point is a narrow but important one, related to 
this particular test carried out by economists (remember that there has 
been great interest in money-supply/output relationships ever since the 
debate over Friedman and Meiselman, 1965). 1 

In particular, it needs to be stressed, because there has been some 
confusion on the point, that Sargent’s point is not about ‘identification’ 
as such. Both the Keynesian and New Classical models may be fully 
identified; that is, the parameters of each may be individually retrieved 
by estimation of the full model (i.e. subject to all its restrictions). How¬ 
ever, there is a useful potential connection with the concept of identifi¬ 
cation. If two models can be ‘nested’ in a more general model (usually, 
a linear combination of the two), then, provided the coefficients of each 
model can be identfied in this general model, it is possible to test for 
their significance and accordingly that of each model. In this situation, 
if (and only if) the coefficients cannot be identified, the models will be 
‘observationally equivalent’. 

Sargent’s point has provoked a lively literature attempting to es¬ 
tablish conditions under which a limited number of reduced forms can 


1 The analogous point also turns out to be applicable to reduced forms for the 
exchange rate (see chapter 14, Box 14.1) as a means of distinguishing ‘equilibrium’ 
from ‘disequilibrium’ open economy models. It will also often be applicable when 
a reduced form on any single variable is appealed to as a means of distinguishing 
equilibrium from disequilibrium models; appeals to the ‘stylized facts’ about any 
single variable have accordingly to be treated with the greatest caution. 
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distinguish between the two models. These attempts have centred on 
identification within such a general model, in this case one which nests 
two distinct reduced forms. 


IDENTIFICATION 

In order to discuss this issue more fully it is necessary first to set out the 
basic issues which arise in indentification of an economic system. (For a 
comprehensive outline the reader should consult a standard econometric 
text such as Johnston and DiNardo, 1997). 

Suppose that the demand and supply for a commodity is given by: 

qt = aupt + a V2 yt (demand) (7) 

qt = a 2 ip t + a 22 z t (supply) (8) 

where the as are constants, p is price, q quantity and y and 2 are exoge¬ 
nous variables which affect demand and supply respectively. Demand 
and supply are assumed to be equal. Random disturbances and con¬ 
stants are omitted from (7) and (8) for simplicity. The equations are 
treated here as structural or behavioural equations and the as as struc¬ 
tural coefficients. The issue of identification is: given that these two 
equations are both operative, can each be confused with the other? To 
put it another way, is it possible to distinguish between each equation 
and a linear combination of the two equations together? 

Form a linear combination of the two equations: 

m + (1 - p)qt = qt = [/win + (! - p)a 2 i]p t + pa 12 y t + (1 - p)a 22 z t 

(9) 

Plainly the demand equation is different from it because it has no 
term in zp, similarly the supply equation because it has no term in y t . 
So if we fit these two equations to the data, we will get estimates of the 
four as. (We would still want to worry about the best way to estimate 
them, given simultaneous equation bias; but that is another matter — 
concerning the technique of estimation) 

Now suppose instead that the supply equation was: 

qt = a 21 p t + a 22 y t (supply) (10) 

Our linear combination of the two equations would then be: 

qt = [pail + (1 - y)a 2 i]p t + \pa 12t + (1 - y)a 22 \y t (11) 
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It is clear from inspection that now neither the demand nor the supply 
equation can be distinguished from this. When you estimate the demand 
equation you could be picking up the coefficients of the supply equation 
or the demand equation or any combination of the two; it is impossible to 
know what you have estimated across this set of possibilities. The same 
applies when you estimate the supply equation. Notice that it matters 
not what technique of estimation you use (TSLS, FIML, indirect via the 
reduced form); the point is you just do not know what you are picking 
up, however sophisticated your estimation method and however much 
data you have. We say that neither equation is identified. 

Now suppose the supply equation was: 

Qt = a 2 iPt + 0,22 z t + a 2 m (12) 

Our linear combination would be: 

qt = [l-mi + (1 - p)a 2 i]p t + [pa 12t + (1 - p)a 22 ]yt + (1 - p)a 2 3Zt (13) 

It is now clear that the demand equation is distinct from this (and so 
identified) while the supply equation is not (and so unidentified). What 
has enabled an equation, in this case the demand one, to be identified is 
a restriction it imposes; viz in this case the exclusion restriction on one 
variable. 

In fact we can generalise this analysis to other restrictions, notably 
cross-equation restrictions on parameters. For example suppose that the 
coefficient an = (I 21 , then plainly we could always be sure to estimate 
On, whatever else we could not be sure of, since in the linear combina¬ 
tion the coefficient on p t would never be anything else. What this also 
illustrates is that even if equations are not identified, coefficients can be. 
In general we can look at which coeffcieints in a system of equations are 
identified. 

Once we introduce this idea of restrictions of any sort, we can also 
find that a coefficient is ‘over-identified’ by which is meant that the 
restrictions can clash in estimating the coefficient. Suppose we go back 
to our original model where each equation was identified by excluding 
a variable. Now super-impose the cross-equation restriction that an = 
021 ] you can immediately see that estimating each equation will provide 
a separate, valid estimate of an. This coefficient is thus over-identified. 
This is not however a problem; 011 the contrary it is rather an embarras 
de richesses, since plainly having two ways of tying the coefficient down 
means there is twice as much information about it in the data. When 
one uses a simultaneous equation estimator, one chooses the value that 
gives the best fit for both equations taken together; this estimator thus 
makes use of all the information in the data to estimate the coefficient. 
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From this it can be seen that the more restrictions we can impose from 
our theory, the more over-identification of coefficients is achieved and 
the more information from any given set of data is brought to bear on 
the coefficients of the model. 


IDENTIFICATION OF KEYNESIAN AND CLASSICAL 
REDUCED-FORM COEFFICIENTS 

We now return to the problem raised by Sargent. Let us initially suppose 
that there are no lagged effects; the salient points can be made for this 
case. We know that a New Classical model can be written in the ‘semi- 
reduced’ form (semi because expectations have not been eliminated): 

Ut = a c X t -1 + b c (m t - + u ct (14) 

and that a Keynesian model will have a reduced form: 


Dt = a k X t -1 -1- b k m t + Ukt (15) 

where the us are the error terms which we can in practice ignore. 

Now suppose that the rest of the model is a policy reaction function 
for the money supply: 

TTlt &mXt— 1 T U m f (16) 

where u m t is a random error independent of u ct and u k t which again we 
can ignore. (Much of the discussion which follows is based on Buiter 
(1983), to which the reader is referred for further analysis.) 

Sargent’s point can now be expressed as the impossibility of identify¬ 
ing b c and b k in this model. As we saw above, since E t -imt = a m X t - 1 , 
substituting into the New Classical model gives us 


Vt = (a c - b c a m )X t -1 + b c m t + u ct (17) 

from which it is plain that the New Classical and Keynesian models 
are unidentified, in other words indistinguishable from each other and a 
linear combination of the two. Similarly, we also showed we could make 
the Keynesian one look New Classical by rewriting it: 

Vt = ( a k + b k a m )X t -1 + b k (?n t - E t ~im t ) + u k t (18) 

We can equivalently write down a linear combination, /z times New 
Classical and 1 — /z times Keynesian; and check that indeed we cannot 
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estimate the parameters in it, or infer the value of n - as lack of identi¬ 
fication plainly implies. This is a useful alternative way of approaching 
the problem. Combining the two equations yields a composite equation: 

Ut = [fj>a c + (1 - fj,)a,k\x t -1 + iib c (m t - E t -imt) + (1 - t^hm 

+ [liud + (1 - y)ukt] 

= aX t _i + b(m t - Et-imt) + cm t + u t (19) 

In this we would like to be able to identify b and c; since we expect 
one or other to be zero, that would in turn identify //. However, as we 
have seen, the substitution of a rn X t -\ for E t --\rrit in the above shows 
that we will estimate 

Vt = (a- ba m )X t _ 1 + {b + c)m t + u t (20) 

enabling none of the coefficients, a, b or c, to be estimated, or by impli¬ 
cation fJL. 

To identify the crucial bi(i = c, k) parameters, we must set one of 
the elements in cq to zero (if X is a single variable, a; = 0 is necessary); 
that is, at least one variable appearing in the monetary policy reaction 
function must not appear in the output equation. Suppose for instance, 


that our model is given by: 

Vt = a c X i t _i + b c (m t - E t -im t ) + u ct (21) 

Vt = akX it_i + b k m t + u kt (22) 

mt — Q'm\X\t— i T d rn 2 ^k 2 t —1 T Unit (^^) 

then the first two equations are distinct. We can substitute for E t ^m t 
in the New Classical equation, for example, to get: 

Vt = ( a c - b c a m i)X lt _i + b c m t - b c a m2 X 2 t-i + u ct (24) 

which differs from the Keynesian one by the term in X 2t -i. In terms of 
our linear combination, we would have 

Vt = {a — ba ml )X u _i + {b + c)m t - ba m2 X 2t _i + u t (25) 


Given that we know the a m i(i = 1, 2) from estimating the money supply 
equation, we can estimate b from the coefficient on X 2t _\ and so c and 
a from the other two. 

What this all amounts to is that the expectations term Et-imt im¬ 
plicitly contains in it other variables known at time t — 1. If these vari¬ 
ables also appear in the output equations then both these equations are 
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indistinguishable because there is no way of telling the effect of expec¬ 
tations in the New Classical from the direct effect in both equations. 
But if the output equations exclude at least one of these variables that 
go to form expected money supply, they can be distinguished. Here an 
exclusion restriction achieves identification. 

This, however, is hard to visualize in any model, because the reduced 
form of output will generally contain all the lagged exogenous variables 
of the model which are candidates for the money supply reaction func¬ 
tion. Barro, in his work on the USA, assumes with this in mind that 
government expenditure enters the money supply function but not the 
output function. This is implausible, given that government expenditure 
appears in the GDP identity. It might be possible to develop the New 
Classical model to exclude all such lagged exogenous variables, on the 
grounds that output depends only on current shocks to supply and to 
money. Go back to our original model, for example, but exclude X t ~\ 


from the New Classical: 

y t = b c (m t - Et^rrit) + u ct (26) 

Ht = a k X t - 1 + b k m t + Ukt (27) 

Tfit — ci rn Xf_i T u m t (28) 

We can now rewrite the New Classical as: 

yt — b r rrif b c a m X t _i T u c t (29) 


which can be distinguished from the Keynesian by a cross-equation re¬ 
striction - viz. if the coefficient on X t -i equals minus that on rn* times 
dm, then the model is New Classical. 

The difficulty with this is that the New Classical model supply curve 
also has the intertemporal substitution effect from real interest rates (and 
in open economy models the analogous effect from the real exchange 
rate); the real interest rate (or the real exchange rate) is affected by 
all the exogenous variables in the model. Those using this test must 
(and in practice do) assume that these effects are negligible. But by so 
doing they restrict ‘New Classical’ to a limited class of model within the 
broader New Classical type; furthermore, some (intertemporal or other) 
substitution is a necessary component of the New Classical supply curve, 
otherwise no Phillips curve is possible. We also saw that there is scope 
for fiscal stabilization through government spending and (when there are 
wealth effects of government bonds) also for monetary stabilization in 
models with these effects. If Barro and his disciples were to demonstrate 
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that the limited New Classical model held, they would be undermining 
the foundations of the New Classical model. 

There remains the issue of partial macro information and signal ex¬ 
traction. This issue is assumed away in Barro’s testing framework: none 
of the current money supply shock is assumed to be inferred from current 
information such as local prices, interest rates and exchange rates. 

Yet again such signal extraction is at the core of the New Classical 
supply curve, as set out for example in Lucas’ ‘islands’ model (Lucas, 
1972, 1973). If included, signal extraction implies that 

E t m t = E t -imt + (( f> 0 u mt + (f>iu ct ) (30) 

where and (/> 1 are linear combinations of structural parameters and 
error variances. 

It can be seen that if the New Classical model has the exclusion 
restriction in it (on real interest rates and real exchange rate effects), then 
it is still possible to distinguish the two models via the cross-equation 
restriction discussed above. Nevertheless, the policy implications are 
different if there is signal extraction: stabilization policy may become 
effective, as discussed in chapter 4. The policy aspects of the Barro tests 
must therefore be approached with caution (this has also been stressed 
by Buiter, 1983). 

The effectiveness of the test proposed by Barro really boils down, 
therefore, to the acceptability of the exclusion restriction he must make 
to allow the test to discriminate. With the exclusion restriction, the 
‘New Classical’ model appears to be vitally impoverished: but without 
it the test is no use. 

Sargent has suggested that models such as (14) and (15), which are 
as they stand observationally equivalent, may be identified if a struc¬ 
tural break is known to have occurred in the policy regime during the 
sample period. If the world is New Classical, then the output coefficient 
on X t _iin the regression on m t and X t _i will be invariant to a change in 
the policy regime (when a m changes). Consequently tests for stability of 
such an equation across the samples before and after the break will dis¬ 
criminate between Keynesian and New Classical models. Nevertheless, 
this is not helpful in general (when breaks in regime either do not occur 
or cannot be known to have occurred), and it has rarely been used. 

This discussion can be extended to include lags. Write our linear 
combination (19) and money supply equation (16) in general lagged form 
as: 


[1 - q{L)]y t = a{L)X t + [6 0 + b(L)\(m t - + [c 0 + c(L)]m t + u t 

(31) 
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[1 - r(L)]m t = a m (L)X t + u mt (32) 

where L is the lag operator and a(L), for example, is a polynomial in 
L (1, 2,...). It is convenient to substitute the error, u mt , for rn t — 
Et-im. t and 7n t = r{L)m t + a m (L)X t + u mt . The hats show that we 
have estimates of these from (32). 

So we rewrite (31) as: 

[1 — q(L)\y t = {u(T) + [co + c(L)] d m (L )} X t + [fro + b(L) + Co 

+ c(L)]u mt + [co + c{L )] [ r(L)]m t + u t (33) 

The b and c coefficients cannot be separately recovered. The cs will 
only be retrievable if at least one of the X t variables is excluded from 
(31) as before in the no-lag case (for example if X t is a single variable, 
then a(L) = 0 will suffice). 

Another possibility for discrimination when there are lags has been 
pointed out by McCallum (1979). This is the special case where the 
classical model contains only the current monetary innovation as a de¬ 
terminant of output. This case corresponds to having b{L) = 0 in (31). 
In this case we retrieve c(L) from the coefficients on lagged money sup¬ 
ply; for the model to be New Classical, lagged money supply should not 
enter the output reduced form. 

The exclusion of past monetary innovations from the output equa¬ 
tions may well not be valid; it places absolute reliance on the absence of 
a moving-average process on the term p t — E t -iPt entering the Sargent- 
Wallace supply curve. Yet it only requires a one-period delay in one 
sector’s observed response to such shocks to yield such a process; this 
surely cannot be ruled out, in view of decision and delivery lags. 

This suggests a final problem: the difficulty of discriminating be¬ 
tween a New Classical and a New Keynesian model with the Barro test, 
supposing that the Keynesian model can be rejected by it in some way. 
Barro’s original test in fact included the current and lagged money inno¬ 
vation in the output equation, using normal data. Yet this is consistent 
with a simple overlapping contract model of the sort illustrated in chap¬ 
ter 4, where both the current and lagged money shock affected output. 
More complex models such as Taylor (1979) add serial correlation to 
such moving averages of shocks, so that these shocks became an ARMA 
process. This is not obviously distinguishable from the processes Barro 
assumes. Only if an assumption is made, as in McCallum, about the 
New Classical error process, can the two be distinguished. Yet as we 
have seen, the basis for such an assumption is questionable, even with¬ 
out confronting the well-known empirical difficulties of testing between 
different error processes. 
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CONCLUSIONS 

The empirical evidence presented by Barro and others on the impact 
of unanticipated or actual money on real variables appears to give im¬ 
pressive support to the New Classical model, as set out in the Sargent- 
Wallace model of Chapter 2. 

Unfortunately, ingenious as Barro’s original idea was and little as 
one would wish to ignore any empirical evidence, there are difficulties in 
the interpretation of these tests: it is impossible to distinguish a New 
Classical from a Keynesian reduced form equation for output unless it is 
assumed that at least in the New Classical model output is unaffected 
by exogenous variables which affect the money supply. This is a difficult 
assumption for New Classical economists to make in view of the im¬ 
portance they attach to intertemporal substitution and/or other mech¬ 
anisms which induce relative price effects in the output supply curve. 
Various suggestions have been made to modify these tests but ultimately 
none has been successful in restoring credibility to this mass of empirical 
work. It should also be noted that even if the tests show a ‘New Classical’ 
model, that does not imply policy ineffectiveness because: (a) New Clas¬ 
sical models with signal extraction in general create policy effectiveness 
and cannot be distinguished from New Classical models without signal 
extraction; (b) a New Keynesian model with overlapping contracts can¬ 
not be distinguished by these tests from a ‘New Classical’ model. To test 
for the restrictions which together give the full New Classical properties, 
the reduced form test of Barro is therefore inadequate. We turn next to 
estimation of structural models and how they may provide tests of these 
restrictions. 
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Direct Tests of Rational 
Expectations 


So far we have looked at two indirect ways of testing the rational expec¬ 
tations hypothesis: the test for market efficiency (a joint hypothesis of 
equilibrium expected returns and rational expectations) and the Barro 
reduced form test for whether any money surprises affect output (a joint 
hypothesis that the model has a ‘Sargent-Wallace supply curve’ and 
rational expectations). We now consider the two direct tests of the ra¬ 
tional expectations hypothesis (REH): direct testing on whether surveys 
of expectations display the properties of rationality, and estimation of 
structural models with a test for the REH restriction. 


SURVEY DATA 

A number of surveys of industrialists and private consumers have been 
carried out over the years for inflation and for output. The best known 
are the Livingston series in the USA and Carlson and Parkin’s (1975) 
interpretation of the Confederation of British Industry survey in the UK: 
another series which has been examined (Visco, 1984) is for Italy, the 
Mondo Economico magazine (with Confindustria) surveys of wholesale 
and retail prices. 

These surveys have generally satisfied the unbiasedness property (a 
weak test of rationality since it is shared with most other expectations 
hypotheses, notably adaptive expectations) but not the efficiency prop¬ 
erty: that is, information available to the survey participants was not 
independent of the expectations errors. Some of them get close to effi¬ 
ciency: Holden et al. (1985, chapter 3) review a number of these survey 
studies. For example, Visco finds that, up to the 1973 oil crisis, the 
Italian survey inflation expectations were unbiased, serially uncorrelated 
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and efficiently incorporated all available information; bias and ineffi¬ 
ciency occur in the few sample years after the oil crisis but Visco argues 
that this reflected turbulence and learning. 

There are two problems with tests based on surveys. First, are those 
surveyed the key participants in the markets? The REH asserts that 
the ‘average participant has rational expectations’: this is an ‘as if’ 
proposition which implies that the price should behave as if those active 
in the market have rational expectations. ‘Those active in the market’ 
means agents ‘on the margin’, to whom a decision whether to buy or sell 
matters at the time. We are all on many occasions not really interested 
in a particular market: for example, we are not most of the time buying 
a house, but when we are we become interested in being well-informed. 
It is clearly impossible for a survey to select those market participants 
who are at each time active in the market. To the extent that the survey 
does not, however, some of those who answer will not be well-informed 
and should not have rational expectations. 

The second problem may be no less important: the veracity of par¬ 
ticipants’ answers. The theory of revealed preference indicates that we 
should look at people’s actions not their words to reveal their tastes or 
views. In answer to questions, either they may not wish to reveal their 
true tastes or views or they may not truly know them — not perhaps yet 
having gone through the decision process that would reveal it to them¬ 
selves, or perhaps not even understanding their implicit views having 
done so (rather like a bicyclist who cannot explain how he has stayed 
upright). 

For similar reasons, economists have paid little attention to surveys 
of whether firms maximize profits or consumers maximize utility. For 
example. Hall and Hitch (1939) and Wilson and Andrews (1951) found 
from questioning firms that they did not consider themselves to be set¬ 
ting prices in a manner consistent with profit maximization. Similarly, 
psychologists have found that some consumers are irrational (Akerlof 
and Yellen, 1987). Economists have preferred to use maximizing models 
in spite of this evidence because the models have performed well empir¬ 
ically. A possible reason for this is that most people take advice from 
experts (usually organised in an industry, such as the forecasting or in¬ 
vesting industry) when decisions are important to them; the REH does 
not suggest that rationality is a trivial attribute, easily acquired in the 
ordinary course of life. Rather it is put forward as a benchmark which 
people strive to attain and devote some resources to when necessary. 

This reveals that it is the empirical performance of a hypothesis — 
the capacity to predict (past or future) conditionally on exogenous inputs 
— that matters to economists, if it is also attractive theoretically (that 
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is, it is logical, tractable and based on non-absurd assumptions about 
behaviour). The REH has obviously exercised considerable theoretical 
appeal, as argued earlier, indeed throughout this book implicitly in one 
way or another. It remains then to consider the last main predictive 
test: structural estimation subject to the REH. 


STRUCTURAL MODELS WITH RATIONAL 
EXPECTATIONS 

The natural way to test the REH is in the context of a macro model. 
Estimate the structural coefficients of the model subject to the REH. 
One may then test the restriction of the REH against less restrictive 
alternatives, such as adaptive expectations or ARMA time series: this 
can be done by a likelihood ratio test if FIML is used or by an F-test 
within single-equation estimation. 

It will help to understand the estimation of a full structural model if 
we take a simple example. We consider first the question of identification. 
Write down an aggregate supply and demand model of the economy: 

Vt = ot(p t - Et^ipt) + /3f t + U s t (1) 

Vt = - p t ) + j(E t p t+ i - p t ) + u% (2) 

We assume that the goods market clears continuously. (1) is the 
supply curve, with surprise inflation and / representing the fundamental 
driving the natural rate of output; (2) is the demand curve, driven by real 
balances and expected inflation (proxying interest rates as in the Cagan 
equation used in chapter 2). We assume that the exogenous processes, 
monetary and real, follow random walks: 

m t = m t - 1 + v t (3) 

ft = ft- 1 + w t (4) 

To check identification we proceed as in chapter 15 and substitute for 
the expectations to find what restrictions they place on the estimation 
of the coefficients in the demand and supply equations. If we equate (1) 
and (2), take expectations at t — 1 and substitute for E t -\m t = m t ~i 
and Et-ift = ft- i, we obtain: 

13ft -1 = <pm t -i - 4>E t -ip t + 'fEt-ipt+i - lEt-m (5) 

This can be rearranged as: 
p _ - (3ft-! 

(0 + 7)(l-^- 1 ) 
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1 

(t> + 7 


B; 

i=0 


' + 7 




= m t -1 - -ft 


( 6 ) 


where R 1 is the expectations forward operator discussed in chapter 2. 
By implication, 
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Substituting for these into our demand and supply curves 
respectively: 

yields, 

Ut 

Oi/3 

= apt - amt -1 - — ft- i + 0ft + 

<P 

(8) 

Vt 

= {0 + i)m + 7 )p t - 1 rft + ut 

<t> 

(9) 


We can see that equation (8) has an exclusion restriction (it excludes 
rrit — notice that though (9) appears to exclude ft- 1 , since f t = ft-i+wt 
it includes it implicitly). But additionally there are restrictions across 
the parameters which we can summarise by saying that there are seven 
coefficients to be estimated for only four parameters. Each equation con¬ 
tains a set of restrictions across its coefficients that distinguishes it from 
the other and from a linear combination of the two. In short the model is 
heavily over identified, as a result of imposing the rational expectations 
constraint. (Under backward-looking expectations the model is identified 
by an exclusion restriction on each equation, provided the expectations 
terms do not smuggle the excluded variables in via the backward-looking 
formula). One can see from this example how demanding the model is 
of the data; that is to say, the parameters, though tightly constrained 
by the model’s over-identifying restrictions, must still fit the data. 

We now turn to estimation.This is not the place to describe fully 
the technical econometrics for estimating an RE model: this has already 
been done by others (e.g. Wallis, 1980; Wickens, 1982; McCallum, 1976) 
and the techniques are available in software packages. There are two 
methods: limited and full information. In limited information (LI), the 
expectations variables are replaced by proxies generated from a first 
stage regression on the full set or a subset of the model’s predetermined 
variables. In full information (FI), the expectations variables are the 
estimated model’s own predictions: a package to do this written by Fair 
and Parke (1989) is available, implementing Fair and Taylor’s (1983) 
procedure. More recently a robust algorithm to compute the full infor¬ 
mation maximum likelihood estimates for a large model together with 
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its bootstrap confidence limits has been developed by Minford and Webb 
(2002). The idea of this is to search over the parameter set in a sim¬ 
ple grid-search-cum-liill-climbing procedure until the likelihood of the 
resulting structural errors is at a maximum. We now briefly describe 
this method. 

Our models are in general nonlinear and so we represent them quite 
generally as 

Vt = f[yt,(L)yt,x t ,(L)x t ,y%,(L)ye-,n]+u t t = 1,2, ,...,n (10) 
Xt = A(L)xt+ €* (11) 


■y t is the vector of G endogenous variables, x t is that of the exogenous 
variables, assumed to follow linear univariate time-series processes. ( L ) 
is the lag series L, L 2 , T 3 , ..., L k (where k is the maximum lag length). 
yt is the vector of rational expectations projections of the model based 
on information at t (these include the ‘natural rate’ variables, which 
are the solution of the equilibrium relationships in the model; they are 
projections of expected equilibrium based on current information). 7r is 
the vector of parameters. u t is the vector of structural equation errors; 
that is to say, the errors when the endogenous variables on the right-hand 
side are at their true values. 

In our method we evaluate the likelihood of (10), to find the set of 
maximizing parameters. Disregarding expectations, Ut measures the fit 
of each equation, regardless of how the poorness of fit in one variable 
spills over into poorness of fit in the others; hence as a parameter varies, 
its effect on the fit of only the variables in the equations where it appears 
is maximized. One can think of this as like simultaneously running OLS 
on each equation if the errors are not cross-correlated. 

However, this discussion omits expectations y ^ which will reflect all 
the model parameters in principle. As a result poorness of model pro¬ 
jection will spill over into equations containing expectations — that is 
all of them. Hence every parameter’s effect on the fit of all equations 
via its effect on expectations will also be taken account of. Given the 
importance of expectations in the models we are dealing with here, this 
amounts to taking account of a high degree of simultaneity. 

We assume that the errors in either case follow a multivariate normal 
distribution with expectation of zero and variance-covariance matrix of 
E; and that estimation has eliminated any autocorrelation. This implies 
that the likelihood is: 


p(ui,U2, ...,u n ) = (27 t ) ” G / 2 (detE) "/ 2 exp 





t =i 



( 12 ) 


We evaluate this likelihood directly. That is to say, for each trial 
set of parameters n we obtain an implied set of iq for which (12) yields 
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the likelihood. This is maximized by the algorithm described above. In 
practice, replacing E by the estimated variance-covariance matrix of the 
errors, the first and last terms of this expression will be constants, so that 
the log likelihood becomes —n/2 In det E and hence we are minimizing 
the log of the determinant of the variance/covariance matrix of the u t - 
For example, where G = 2, we are minimizing ln(dy ay — oy 2 )- With 
disturbances uncorrelated across equations, this minimizes the log of the 
product of the sums of squared residuals or In a\ + In a\. 

Because there is no analytic method for computing the confidence 
limits in the effectively-small samples we have, we must use bootstrap¬ 
ping. The bootstrap procedure takes the u t generated by the maximum- 
likelihood parameter set, if. It then resamples these with replacement 
and inserts the sample with n into (10) to generate the bootstrap sam¬ 
ples of yt . In order to preserve the correlation across the elements of 
u t , we sample u t by t -th vector — that is, we draw the t vectors re- 
pearedly in different orders (t = 1,2,..., n).The Xt are held constant 
across bootstraps. Then the above estimation procedure is repeated on 
each bootstrap sample. 

FIML estimation as above thus locates the set of parameters that 
generate maximum likelihood, taking account of the expectations they 
generate as well as their fit given any set of expectations. To find the 
expectations requires model solution (convergence) in every period of 
the sample: this is a very stringent requirement and experience with 
the Liverpool model of the UK shows that it is frequently not met for a 
macroeconomic model, even of limited size (as with the Liverpool one), 
with arbitrarily chosen parameters. Hence the one available programme 
for estimating such models by FIML, that of Fair and Parke (1989), can 
easily break down if the initiating parameters do not permit convergence; 
the algorithm, finding non-convergence, may be unable to locate a pa¬ 
rameter set which does converge and the estimation does not get off the 
ground. 

For this reason, it appears necessary to begin with LI estimation and 
model simulation to locate a promising initial set of parameters. Move¬ 
ment towards maximum likelihood can then be achieved by iteration, as 
follows: (1) generate expectations (and equilibrium values) from the ini¬ 
tial set of coefficients and compute their likelihood; (2) holding expecta¬ 
tions and equilibrium values constant, run a standard FIML programme 
to convergence to obtain a new set of coefficients; (3) repeat steps (1) 
and (2) until the likelihood in step (1) is maximized (for more detail 
see Minford and Webb, 2001). Confidence limits can then be found by 
bootstrapping this FIML procedure. 

We do not yet know how much difference it makes, especially in 
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small samples, to use FI rather than LI estimation. Asymptotically 
(i.e. in large samples), standard errors should fall in moving from single 
equation to FIML estimation (West, 1986), for example, computes the 
asymptotic reduction in standard error for a small model as typically 
a rather small 20 per cent). Work on the Liverpool model in Minford 
and Webb (2001) indicates much bigger reductions for major structural 
parameters in preliminary work. 

When structural coefficient estimates have been obtained the REH 
restriction can then be tested. However, this has not been done at all 
widely. The reason is that for LI estimates the test has hardly any power: 
the less restricted expectations merely imply a different group of first- 
step regressors and the test involved is the weak one of chi-squared on 
choice of instruments. Most RE models have been estimated by LI. 

FIML estimates can test the restriction with considerable power. But 
so far such tests are not available. It is possible that, just as the over¬ 
identifying restrictions imposed by rational expectations force parameter 
values into a narrow range to achieve compatibility with all the data, so 
relaxing them by adopting backward-looking expectations may achieve 
a closer fit to the data. The question would then be posed: should one 
accept better fit at the expense of less demanding theory? Much would 
depend then on whether the loss of theory damaged the stability of rela¬ 
tionships. It is possible that models will give worse in-sample fit under 
RE than under backward-looking expectations but their consistency with 
theory may also yield better out-of-sample (forecasting) stability. 


VECTOR AUTOREGRESSION ESTIMATION OF 
MACRO MODELS 

An increasingly popular way of estimating models, following Sims’s (e.g. 
1992) rejection of orthodox structural models as having ‘incredible’ iden¬ 
tifying assumptions, is the VAR. In Sims’s and his disciples’ original 
methodology, the idea was to abandon normal identification altogether 
and simply estimate the unrestricted ‘joint final form’ of the endogenous 
variables of a model (in stationary form), treating the exogenous vari¬ 
ables as error processes to be combined with ‘structural’ errors. Each 
endogenous variable would then be a function of its own and other vari¬ 
ables’ lagged values, once one had substituted out for the current period 
interaction with other endogenous variables. ‘VAR identification’ would 
then consist of deciding on a recursive ordering of the endogenous vari¬ 
ables in the current period — for example, output might respond first, 
then prices. Then one can label the errors as ‘the output error’ and by 
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subtracting from the other error this error’s effect one can identify in 
the price residual the pure ‘price error’. 

To clarify matters, let us apply it to our simple example. If we take 
equations (8) and (9), difference them and substitute out for the exoge¬ 
nous processes we obtain the following equations in stationary variables 
and errors: 

oi(3 

A y t = aAp t - au t _i - — w t _i + f3w t + u s t - u s t _ x (13) 

9 

^ Q 

At /t = (<f> + i)v t -(</> + 7 )A p t - Ij-Wt + uf - uf_ ± (14) 

9 


Notice that the errors are in fact a mixture of structural and ex¬ 
ogenous process errors: call them respectively efand Gf. In order to 
create the VAR where moving-average processes are ruled out in favour 
of autoregression for convenience of estimation, we represent the error 
processes as autoregressive: 

p* £ d 

Multiplying through by lag operator terms and substituting out for 
the current interactions we obtain the joint final form as: 


A y t = diAt/t_i + <9 2 Ap t _i + rf t (16) 

Ap t = <9 3 Ay t _i + diApt-i + rf t (17) 


where 


di = {psl<t> + l] +ap d )/D 
d 2 = -a[^ + 7](p s - p d )/D 
ds = -{p s - Pd )/D 
di = {ap s - p d [<t> + 'i\)/D 
D = a + <j> + 7 

Vt = (e f s [0 + 7] +a &t)/D 

rft = {-et + efi/D 


It is plain that VAR identification is now difficult since the two errors 
are mixtures of the shocks to output and prices. 

One identification scheme that has been widely used was proposed 
by Blanchard and Quail (1989). This is to assume that only supply 
shocks (which we will here identify with €*) have a long-run effect on 
output; or in other words demand shocks have no long-run effect. To 
make the situation as favourable as possible, assume that both errors 
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are actually changes in (permanent) demand and supply shocks; thus 
G|= A S u &t = A D t where St, D t are permanent supply and demand 
shocks respectively. Then it is easy to see that the necessary restriction 
to prevent D t affecting output in the long run is that a = 0. In this case 
the error in the output equation of the VAR, (15), is identifiable as 
the innovation in the supply shock. One can then attempt to disentangle 
the effect of this shock (using correlation analysis or some prior estimate 
of (f> + 7 ) from the error in the price equation and so identify the demand 
shock. 

Clearly the Blanchard-Quah assumption is entirely justified (and of 
course emerges from our simple model). The problem lies in applying 
it in practice to the mongrel errors we have in this model. Our ‘supply 
shock’ composite is as (13) reveals not really the change in a permanent 
supply shock at all, but a mixture of that (A uf) with innovations, current 
and lagged, in money and productivity. It is for this reason that when 
we apply the constraint in practice, we get the obviously wrong result 
here that a = 0 , when of course plainly it is non-zero; the reason is that 
our error includes the lagged effect of errors on expected prices. While 
in more complex models it may be possible to make a better case for the 
practical method of applying such an identifying restriction, this simple 
example shows the potential pitfalls. 

It has by now been widely recognized that somehow VAR models 
must be constrained by restrictions of the structural model so that sen¬ 
sible identification can be achieved and yet sufficient freedom be allowed 
for the joint final form to fit the data flexibly. This half-way house has 
for example taken the form of imposing the long-run restrictions of all 
the theory (through a set of cointegrating relationships) but allowing 
the short-run dynamics to be entirely unconstrained (Motto and Wick- 
ens, 2000). This takes somewhat further the Blanchard-Quah approach. 
This of course still leaves short-run impacts unidentified except by arbi¬ 
trary assumptions about the recursive ordering of variables. 

The dilemma for VAR proponents is that the more structure is im¬ 
posed the less the data is incorporated flexibly in the estimated coeffi¬ 
cients; while identification requires some theoretical structure, which is 
hard to impose in the flexible joint final form of the VAR. The resulting 
estimates of the VAR are hard to interpret, apart from being routinely 
vulnerable to the Lucas critique. 
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CONCLUSIONS 

The rational expectations hypothesis is a modelling assumption. It does 
not assert that everyone everywhere always have rational expectations 
any more than the marginal utility hypothesis asserts that all consumers 
everywhere maximize utility at all times. Therefore tests of the hypoth¬ 
esis through surveys are, though interesting, not persuasive, since they 
check what certain groups (say they) think rather than what their and 
others’ behaviour is in the market. A modelling assumption can only 
ultimately be tested, as Friedman (1953) has stressed, by testing the 
goodness of lit of the model, with and without it. Such a test is still 
not widely practised as it requires full systems estimation of a macro 
model with all the RE restrictions imposed and this has proved difficult 
in practice; we discuss some recent work implementing a new algorithm 
and bootstrap method for this purpose. VAR analysis is widely used 
to describe the lead-lag relationships between endogenous variables and 
the shocks driving them; but identification is a serious problem for inter¬ 
preting these models as evidence for any particular expectations process 
or indeed for structural coefficients generally. We conclude with the sug¬ 
gestion that the best hope for estimating and testing a modern macro 
model and rational expectations is to use full systems methods along the 
lines sketched out here. 



17 

Where are we going? 


This book has reviewed a huge body of work. In this brief last chapter 
we look at only one issue but the central one of this subject: how are 
macroeconomists today tackling the modelling of the economy, how does 
it compare with what went before and in what directions is it leading? 

When the first edition of this book was written twenty years ago mod¬ 
els of the economy contained backward-looking expectations and were 
large systems of demand and supply equations of the IS/LM /Phillips 
curve type. We reviewed them in Chapter 1 of this edition. We argued 
then that they were misspecified, mainly because of the lack of rational 
expectations. We were, however, quite doubtful of how much benefit 
would be derived from jettisoning the demand/supply curve set-up as 
well as the expectations assumption; the Lucas Critique suggested these 
curves could shift under relevant regime change but the evidence was 
mixed on the seriousness of such shifts for practical policy evaluation. 
But we agreed that the representative-agent and real-business-cycle re¬ 
search programme was exciting and could yield eventual results. 

Two decades on, two contrasting things have happened. On the one 
hand representative-agent models have been developed in great numbers 
with a wide variety of assumptions about constraints (on cash and liq¬ 
uidity needs, notably), utility functions and investment technology. On 
the other hand we have seen the return of models with nominal rigidities 
based on Phillips curves with overlapping wage contracts, completed by 
an IS curve and an interest-rate setting rule for monetary policy — sim¬ 
ilar in many respects to the neo-Keynesian-synthesis models of Chapter 
1. This is a paradox indeed. Adherents of the latter (Clarida et al., 
1999) have proclaimed that these models now dominate the field as ve¬ 
hicles for the ‘new science of monetary policy’. Proponents of the former 
are quietly contemptuous of such claims, since such models are nothing if 
not rough and ready aggregations of decision rules which will shift with 
regime change; policy evaluation could not credibly be based on them. 
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To which the response of these latter-day neo-Keynesians has been 
that their models do ‘lit the facts’ in recognisable traditional ways: 
B 2 , predictive performance and so on. In terms of fit the representative- 
agent models have proffered the mimicing of unconditional moments 
in the data by the models — as opposed to the conditional moments 
stressed in the traditional methods. This has been fairly unconvincing 
to the bulk of the profession; policy makers have generally ignored these 
models as a result. 

There has been another problem with these models. Their complex¬ 
ity is by now overwhelming — in a way that used to be levelled as a 
severe criticism at the large neo-Keynesian models of Chapter 1 (‘too 
many equations for anyone to understand the reasons for what is going 
on’). These models are extremely large and nonlinear, posing problems 
of solution and possible multiplicity of equilibria that have not been tack¬ 
led; virtually all practitioners ignore them, find a steady state, linearize 
around it, and then use heavy-duty mathematics to establish theorems 
about how the economy will behave over the business cycle. Yet these 
claims lack credibility when the models have not themselves been solved 
and the existence and stability of the possible equilibria have not been 
established. 

It is not surprising perhaps in these circumstances that central bank¬ 
ers and Treasury officials have turned gratefully back to the new-look ISj 
LM /Phillips curve models they were brought up on and have now been 
told it is respectable again to use. Yet when one considers this as a basis 
for serious monetary policy regime choice, one cannot but be dubious. 
We have spent much time in this book examining why such methods are 
vulnerable. Monetary regimes underpin the length of nominal contracts 
themselves, and their degree of indexation; we pointed out in Chapter 6 
that the issue of whether central banks should target the level of prices 
or the money supply may well turn precisely on these features of the 
environment. 

Meanwhile, more or less orthogonally to this modelling aimed at pol¬ 
icy regime analysis, we have burgeoning statistical modelling centred on 
vector autoregression (VAR) where, as we argued in the last chapter, 
there is restricted scope for convincing identification of structural coeffi¬ 
cients. Essentially therefore this is a black-box forecasting methodology 
with limited use in policy analysis. Even as a description of ‘the stylized 
facts’ of the effects of shocks such as a monetary contraction on output 
and prices, it is unconvincing; the reason being that identification of the 
shocks in these equations is imposed from some scheme whose choice is 
a subjective matter for the researcher. 

What this all adds up to is that while macroeconomics has been an 
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exciting subject to be in during the past twenty years, it has failed yet to 
converge on any consensus of how to model economies or how to evaluate 
policy. Yet in spite of this there has been an extraordinary convergence 
in the area of monetary policy among both macroeconomists and policy 
makers: on the proposition that monetary policy’s key role is to control 
prices, regardless of the short-term unpleasantness of the task. This was 
most decidely not the case twenty years ago; then Keynesians asserted 
in the teeth of monetarist fury, that ‘core’ inflation was too costly to 
bring down by monetary and supportive fiscal policies and that instead 
incomes and price controls should be used. Such views today would be 
unthinkable. 

This suggests in the end that we have seen substantial progress in 
macroeconomic thinking on the crucial issue of how inflation is deter¬ 
mined; here the rational expectations revolution was the key factor with 
its emphasis on the potential speed with which an inflation equilibrium 
could change in the face of a serious change of policy. We still have much 
work to do on how the business cycle is determined and how that in turn 
interacts with monetary policy. 



Time-series annex: an introduction 
to linear and nonlinear time series 


The purpose in this Annex is to provide an introduction to some of 
the relatively recent developments in time series which are becomingly 
standard in empirical analysis in macroeconomics and finance. We will 
not be concerned with statistical tests in this chapter. The interested 
reader can consult the references provided for these tests and a more 
detailed and formal analysis of the material we cover. 


LINEAR TIME-SERIES MODELS 

ARMA Processes 

In chapter 2 we introduced the Wold decomposition as a method - 
the Muth method — of solving linear rational expectations models. We 
found that all variables in a linear model can be described by such a 
decomposition. For a variable y t it is given by 

OO 

yt = y + ^2PiU t -i (i) 

2—0 

where y is a deterministic component assumed at this stage to be a 
constant and /3 i are constants; the u t - l are random variables which have 
zero expected value and constant variance and are assumed at this stage 
to be normally distributed and independent of all past information. Such 
an error process is called Gaussian white noise.. 

It is assumed that y t is a covariance-stationary process so that its 
properties are unaffected by a change of starting date. This results in the 
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mean, variance and the (auto) covariances of the series being constant. 
Thus for y t to be stationary for all values of t, 

E(Vt) = V ( 2 ) 

var {y t ) = E(y t - y ) 2 = a 2 y < oo (3) 

co v(y t yt +s ) = cov(y t y t - s ) = E(y t - y)(y t+s -y) = 

E{yt-y){yt-s-y)=is ( 4 ) 

where in (2) y is the constant mean, in (3) a 2 is the constant variance 
and in (4) 7 S is the constant s-order covariance. Notice in particular 
that the covariance between y t and y t + s depends only on s, the lead or 
lag between the two y values, and it is a measure of whether they move 
together. Also, the covariance for lag zero is the variance, 7 0 . Naturally 
a series may have to be transformed by extracting a trend or differencing 
to induce stationarity and we shall see below the transformation required 
to induce stationarity has important implications. 

Equation (1) is known as a moving-average (MA) error process. The 
order of a moving-average process is given by the maximum lagged error 
term so that y t =2 + u t + — 0.01u t _5o is a moving-average error 

process of order fifty, MA(50). 

We can also write a linear process in terms of its own past values. 
For a variable Xt we have 

OO 

x t =x + OLiXt-i + e t (5) 

i =1 

where x is a deterministic component, i.e. contains no random elements, 
such as a constant and or trend, a, are constants and e* is a random 
variable. 

Equation (5) is known as an autoregressive process (AR process). 
The order of an autoregressive process is given by the maximum lagged 
x term so that x t = 0.95a7_i + 0.01xt_2 +et is an autoregressive process 
of order two, AR(2). 

Consider the MA(1) process 

y t =y + u t + Piiit- i (6) 

where \ f3 1 \< l 

Employing the lag operator, L, we can write (6) as 
yt = V + (1 + Pi L)u t or = Ut 


( 7 ) 
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Expanding the polynomial in the lag operator, 1+ \ L = l + (—/3 1 L) + 
P\L 1 2 + (—/?i) L 3 + ... we find that 

Vt = — fiiyt-' 2 + PiVt-3 + ••• + «( (8) 

So we observe that the MA(1) process can be written as an AR(oo). 
More generally, assuming the MA polynomial can be inverted, any MA 
process can be rewritten as an infinite AR process. 

Consider next the AR(1) process x t = x + aiXt-i+et- Employing the 
lag operator and the same procedure as above we can write the AR(1) 
process as an MA(oo) 

Xt = -— -h £t + Qpe t _l + + afe t ~3 + ... (9) 

1 — a\ 

For empirical purposes we can model a series as a finite mixture of 
autoregressive and moving average components. Such a process is known 
as an ARMA process and for a series y t is written as 


Ut — yo+<^i2/t-i+Q!2yt-2+Q!3yt_3+.. .+a p yt- p +Ut+PiUt-i+(32 u t-2+ 

Pz u t-z + • • • + /3 q u t - q (10) 

where yo is a constant. 

Equation(lO) has p autoregressive terms and q moving average terms 
and is known as an ARMA(p, q). 

Consequently the process y t = 2 + 0.5y t _i + u t + 0.1u t -i — 0.01u t „2 
is an ARMA(1, 2). This process can also be rewritten as either an 
ARMA(oo, 0) or as an ARMA(0, oo). 

Any variable in a linear model has an ARMA(y, q) representation 12 . 
Naturally the ARMA representations of a variable will in general change 
if the policy regime changes, hence ARMA representations of endogenous 


1 For instance in the model (all standard notation) 
m t = pt + yt 

m t = m + ppt-l + ut 
y t = y + a(p t - E t -rPt) + 

Assuming rational expectations we can obtain ARMA reduced forms for each of the 
variables where yt is an ARMA(1,0), pt is an ARMA(2,1) and mt is an ARMA(2,2). 
We leave this as an exercise for the reader. 

2 We should note that care has to be taken empirically not to over-parameterize a 

process. For instance the process yt = 0.75t/t-i— 0-lyt-2+ut— 0.75rtt_i+0.1rtt_2 can 
be simplified to yt = ut (since 1 — 0.75L + 0.1L 2 is a common polynomial for the AR 
and MA processes.) Simple macro models typically imply ARMA representations for 
variables which are much more complicated than parsimoniously estimated processes. 
The fact that roots cancel, or near cancel, is probably the reason. 
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variables are naturally subject to the Lucas critique. Given this point 
however ARMA processes can be readily estimated using many statis¬ 
tical packages and are often employed to model exogenous variables in 
macroeconomic models as well as representing the ‘naive’ forecast in fore¬ 
cast comparisons.They have also been suggested as proxies for modelling 
expectations, e.g. Feige and Pierce (1976). 

We defined in (4) above the (auto)covariance function. It is fre¬ 
quently more convenient to employ autocorrelations rather than covari¬ 
ances since autocorrelations vary between —1 and 1. They are simply 
the correlations between y t and yt+ s , and are defined by 

Ps = ~ ( 11 ) 

7o 

As s is varied the values of p s trace out the autocorrelation function 
(ACF). For a white noise series, u t , since each value is uncorrelated 
with every other value, all the autocorrelations E(u t u t + S ) are zero for 
all s 0, and as the mean and variance are constants, the series is 
stationary. 

More generally, in the regression framework the autocorrelations are 
given by the coefficient in the regression 

Vt = PsVt-s + e t (12) 

where e t is the error process. 

Another important property of a series is its partial autocorrelation 
function (PACF). This is possibly best explained in the context of re¬ 
gression. Consider the regression of y t on yt-\ and y t -2- 

Vt — tpuVt-l + 4 > 22Vt—2 + et (13) 

where the symbol e t is used for convenience again to denote the error 
process though it differs from that in (12). The first subscript on cp 
indicates the lag of the variable concerned (1 for y t _ 1) and the second the 
maximum order of the regression (2 in this example). The coefficient on 
yt-2 is the partial autocorrelation coefficient since it indicates the partial 
or extra effect of adding y t -2 to the equation when y t - 1 is already there. 
In the case of (13) it is the partial autocorrelation coefficient (PAG) of 
order 2. The first order PAC is given by 

Vt = (piiVt-i + e t (14) 

and from comparison with (12) for s = 1 we observe that (p n = p 1 . More 
generally, the PAC of order p is <p pp in 

yt — 4 >i P yt-i + (p2 P yt-2 +... + (p pp y t - p + e t 


(15) 
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The properties of the ACF and PACF are important guides to deter¬ 
mining the generation process for a series. If a variable is stationary a 
sample of data for y can be regarded as a sample of n observations from 
the underlying probability distribution of y. This means that we can use 
the data to estimate these parameters.The estimates based on a sample 
of n observations yi, y%, • • •, y n are 


* Y, y 

y= — 

n 


(16) 




^ 2= E (yt- y) 


n — 1 


(17) 


E (yt-y){(yt+ s - y) / lg x 

s n — s 

Theoretical Autocorrelations and Partial Autocorrelations 

Consider the MA(1) process 

y t =y + u t + Pu t - 1 (19) 

where Ut is a random variable defined as in (1). The expectation of yt. is 
given by 

E(y t ) = E(y + ut + Put- 1) = y + E(u t ) + PE[v,t— i) = y (20) 
The variance of yt is 

E{y t — y) 2 = E(ut + Piit-i) 2 = E(u 2 + 2Pu t ut-i + P 2 u 2 -\) 

= a 2 + 0 + p 2 a 2 = (1 + p 2 )a 2 (21) 

where cr 2 is E(ut) 2 , the variance of u t . The first autocovariance is 

E{yt - y){yt -1 -y) = E(u t + Pu t -i)(u t -i + pu t - 2) 

— E(utUt— 1 + pUt/Ut- 2 + Pu 2 _ 1 + p 2 Ut-iUt— 2 ) — P IJ ~ (22) 
Higher autocovariances are all zero: 

E(y t - y){yt-s -y)= E(u t + pu t -i)(u t - s + Puts- 1) = 0 (s > 1) 

(23) 
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From the definition of the autocorrelation (11) 

= 7i = P° 2 = P 

Pl 7o (1+/?V (1 + P 2 ) 

Higher autocorrelations are all zero. 

For the gth order MA(q) process 

Ut=y + U t + PlIH-l + P2 U t-2 + • • ■ + P q U t -q 
The mean of (25) is given by 

Ey t = y + E(v,t ) + PiE(u t - 1 ) + P 2 E{ut~ 2 ) 4— • • • + P q E(ut~ q ) = y 

(26) 

The variance of the MA(q) process is given by 

7o = E (yt - yf = E(u t + PlUt -1 + P 2 U t -2 + • ■ • + PqUt-q ) 

{lit + Pi'Ut-l + P 2 Ut~ 2 + • ■ • + PqUt-q) (27) 
since the us are uncorrelated so that E(u t u t ~ s ) = 0 for s > 1 

7 0 = E(y t - yf = a 2 + p\a 2 + p\a 2 + ... + 0 2 q a 2 = 

(l+Pl+Pl + .-.+pDo 2 (28) 

The autocovariances are given by 

7 S = E{y t - y)(yt-s -y) = E(u t + Phh-i + P 2 ut -2 + ■ ■ ■ + P q u t - q ) 

(Ut.-s + P\Ut- s -l + p2 u t-s-2 + • • • + P q Ut-s-q) (29) 
The cross terms all have expectation zero so (29) gives 

7s = E(y t - y){y t -s -y) = E{P s u 2 _ s + P s+1 Piu 2 _ s 

+ Ps+2P2 U t-s-2 + • • • + P q P q -s U t~q) ( 30 ) 

so that 

7s = {Ps + Ps+iPi + PS+ 2 P 2 + ■ ■ ■ + PqPq-sW for s = 1, 2 ,q (31) 
7 S = 0 for s > q 

For example, consider the MA(3) process, yt = y + u t + P\U t -1 + 

P 2 u t -2 + P 3 u t -3■ In it: 


(24) 


(25) 


— (1 + Pl + 02 + ^ 3 )°"“ 

= (1 + Pi + P 2 P\ + PsPfu 2 


To 

7i 
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so that 


72 = (1 +P 2 +/3 3 PiW 2 

7 3 = (As) 0 ' 2 

74 = 75 = • • • = 0 


Pi 

P-2 

P3 

Pi 


71 _ (1 + /?! + /?201 + P 3 P 2 ) 
7o (1 + 0\ + 02 + 0i) 

72 _ (1 + /?2 + /^3^l) 

7o (1 + Pi + /?! + /? 2 ) 

73 = (# 3 ) 

7o (1 + Pi + P 2 + $ 5 ) 

P 5 = ■ ■ ■ = 0 


Autoregressive Processes 


Consider the AR(1) process 

Ut =Vo + aiyt-i + u t 

where Ut is a random white noise error process and | cti |< 1. 
Writing (32) as an infinite MA process we have 

I/O 9 o 

Ut = z -h u t + ai'Ut-i + o.]Ut —2 + 07^—3 + • • • 

l — o 

Consequently the mean of (33) is 


(32) 


(33) 


E(y t ) = E 


Vo 2 q 

+ Ut + QLlUt-1 + (X{Ut—2 + Q-iUt —3 + • • ■ I = 


1 — a\ 


y 0 


= y (34) 


1 — Ol 

The variance is given by 

7o = E (y t - xj) 2 = E(u t + aiut-i + a\u t -2 + a?u t _ 3 + ... ) 2 (35) 

so that 

7o = E(y t - y) 2 = E(1 + a\ + af + af + ... ) (36) 

therefore since 1 _ 1 m = 1 + u\ + af + af + ... 


7o = E (y t - y) 2 = - - 2 

1 — ol\ 


(37) 
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(37) illustrates why the condition | ol\ |< 1 is required for stationarity. 
The sth autocovariance is given by 


7 S = E (yt -y){yts-y) = 

E(Ut + OL\Ut-l + OL\Ut ~2 + ■ • • + 0/i‘Ut-s + Qq +1 Ui_s_i . . . ) 

(' Ut-s + OL\U t -s -1 + Ol\ut-g—2 + &lUts-3 + • ■ • ) (38) 

so that 


7s = E (yt - y){yt-a - y) = (a! + a s i +2 + al +4 + ...)a 2 = 

a s [l + a\ + a\ + ...) (39) 

therefore 

7s = E{y t - y)(y t - s -y) = 

As a consequence the s-autocorrelation is given from (39) and (37) 
by 


Ps = 


7s 

7o 


— Qt .-1 


(40) 


Consequently the autocorrelation function follows a pattern of geo¬ 
metric decay. 

Rather than employing the infinite moving-average representation 
to derive these properties we could have assumed that the process was 
stationary and proceeded directly as follows. 

Taking expectations of (32) 


E(y t ) =yo + a 1 E(y t -i ) + E(u t ) (41) 


Letting E(y t ) = y = E(y t _ i) we have 

V = Vo + ary + 0 (42) 


or 


y = 


y o 

1 — CKl 


(43) 


Similarly the variance is obtained by substituting (42) into (32) to 
obtain 


yt = 2/(1 - ai) + ary t -i + u t 


(44) 
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or 


y t -y = «i {yt-i ~y) + u t (45) 

If we square both sides of (45) and take expectations: 

E(yt - y) 2 = ot\E(y t - 1 - y) 2 + 2a 1 E((y t -x - y)u t ) + Eu\ (46) 

Since u t is uncorrelated with y t -1 (which is a summation of shocks 
at t — 1 and before from (33)), we obtain from (46) that 

E(y t - yf = Oi\E{y t _i - y) 2 + a 2 (47) 

The assumption of stationarity implies that 

E(y t ~ y? = E(y t _ i - y) 2 = 7 0 (48) 

so that the variance of y t is given by 

2 

7o = i?(y t -y) 2 = r ^ (49) 

Also multiplying (45) by y t - s — y and taking expectations we can 
obtain the autocovariance function 

7 S = E{y t - y){y t ~s -y) = cn\E{y t _i - y){y t -s ~y) + E(u t )(y t - S - y) 

(50) 

The last term on the right-hand side of (50) has zero expectation and 
using the fact that t — s = t — 1 — (s — 1) implies that the first term on 
the right-hand side of (50) is E(y t ~i — y){yt- l-(s-i) ~ V) which is the 
s — 1 autocovariance of y t . Consequently (50) simplifies to 

7 S = «i7s-i ( 51 ) 

This difference equation has the solution 

7s = a i7o ( 52 ) 

(52) is the solution (40). 

We next consider the AR(2) process 

Vt = 2/o + a iVt-i + ot 2 yt .-2 + Ut. (53) 

Using the procedures above, so that we once again assume station¬ 
arity, 

E(y t ) = yo+ cnE(y t -i) + a 2 E{y t - 2 ) + E(u t ) (54) 

and 


E(y t ) = E(y t -i) = E(y t - 2 ) = y 


(55) 
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so that 


E(y t ) 


1 


J/o_ 

Qq ~ Oi 2 


= y 


Rewriting (53) using (56) we obtain 


(56) 


y t -y = oci{y t -i -y) + a 2 {y t - 2 -y)+u t (57) 

Following the procedure for the AR(1) case we multiply (57) by y t - s — 
y and take expectations to obtain 

7 S = «i7 s -i +«27 s -2 (58) 


Consequently we observe that the autocovariances follow the same 
process as the the autoregressive process. Restrictions on oq and a 2 are 
required to ensure stationarity. 

The autocorrelations are obtained by dividing (58) by j 0 . We obtain 
p s = a\p s _i + a 2 p s _ 2 (59) 

For s = 1 we obtain 

Pi = oqp 0 + a 2 p_i (60) 


p_i, the autocorrelation one period ahead, is the same as p 1 by the 
assumption of stationarity and p 0 = 1 (the autocorrelation of a variable 
with itself), hence p x = aq p 0 + a 2 p x so that 


For s = 2, 


Pi = 


ai 

1 — a 2 


(61) 


P2 — a lPl + a 2 


(62) 


To calculate the variance we multiply (57) through by y t — y and take 
expectations to obtain 


E{y t - yf = aiE(y t - y)(y t -1 -y) + a 2 E(y t - y)(y t -2 - y) + 

E(y t -y){u t ) (63) 

so that 


7o = «i7i + «272 + ^ (64) 

where 0 2 = E{y t — y)(u t ) since y t = u t + linear terms in lagged ys which 
are uncorrelated with u t . 
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We write (64) using definitions of p x and p 2 in terms of 7 0 , as 

To = OiPiTo + U 2 P 2 I 0 + v 2 (65) 

Substituting from (61) and (62) into (65) we obtain 

To = «1 ^ °^ a ^ j To + «2 (ar -- + a^j 7 0 + u 2 (66) 
which simplifies to 

7 0 (1 — 02 - of a 2 a\ - a\ + a\) = (1 - 2 

which can be rewritten as 


To = E (Vt - V) 2 


<r 2 (l - a 2 ) 

(1 + a 2 )[(l - a 2 ) 2 - a 2 ] 


(67) 


Pth Order Autoregressive Process and General ARMA Pro¬ 
cesses 


The mean, variance and autocovariance function for the ptli order AR(p) 
process can be obtained in a similar manner to the above though the 
mathematics is more difficult — see Hamilton (1994). 

The key property is that the autocorrelations and autocovariances 
follow the same process as the AR(p) process, known as the Yule-Walker 
equations. Consequently the autocorrelations are given by 


Ps — a lPs- 1 + a '2Ps-2 + a 3Ps-3 + • • • a pPs-p (68) 

The solution of this difference equation will give p roots if they are 
distinct. Consequently the autocorrelations can display a variety of pat¬ 
terns, dependent upon the signs of the roots and whether they are com¬ 
plex, but they will decay over time. 

For the ARMA(p, q ) process the autocorrelations will be given by 
the Yule-Walker conditions, i.e the pattern of coefficients governing the 
AR(p) component but only for s>q. Up to this point moving-average 
terms will induce more complicated patterns in the autocorrelations than 
the pure AR(p) process. 

Finally in this section we illustrate diagrammatically the time-series 
behaviour of a few ARMA processes. In figures 18.1a-f we plot the re¬ 
alizations of a variety of processes. The random error term is drawn 
from a random number generator that has a normal distribution and is 
Gaussian white noise (see footnote 3 for the definition). The standard 
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deviation of the error is 1. We set the initial values of yo, yi at zero and 
simulated 1000 observations of each process. The realizations from ob¬ 
servations 600-1000 are plotted in the figures. The ACF and PACF are 
also reported for the first ten lags of the processes. If the sample size (n) 
is large, the estimated individual autocorrelations and partial autocorre¬ 
lations have a standard error of approximately -^= and their significance 

can be tested using a normal distribution (-^= is approximately 0.05 for 
our data sets). In figure 18.1a we plot the random variable. Each of 
the autocorrelations is not significantly different from zero as would be 
expected from theoretical considerations. Figure 18.1b plots an MA(1) 
process with coefficient —0.6 which appears somewhat more spiked than 
the white noise process.The ACF is significant at order 1, consistently 
with the theoretical result above. The PACF is significant at orders 1 
and 2. If we recall that the MA(1) can be written as an AR(oo), the 
insignificance of coefficients in the PACF after order 2 informs us the 
series can be well approximated by an AR(2). 

Figure 18.1c is a plot of an AR(1) process with coefficient 0.9. The 
plot exhibits an outcome that looks fairly cyclical. The ACF declines 
relatively slowly and the PACF has a significant coefficient at order one; 
the remaining coefficients are insignificant. As the AR(1) coefficient 
declines the waves become more choppy. This is illustrated in figure 
18.Id with an AR coefficient of 0.5. For this case the ACF does not 
decline geometrically as theoretically predicted. This is the result of 
sampling error. The PACF exhibits one significant coefficient at order 
1. In figures 18.le and If we plot an ARMA(1, 1), with coefficients 
AR(0.9) MA(—0.25); and an AR(2), with coefficients 1.3 and —0.4, which 
are close to those found for some aggregate macro time series. The 
series exhibit similar cyclical behaviour though the amplitude is different. 
For the ARMA(1, 1) case the ACF declines slowly and the PACF is 
significant at order 1 and 2. For the AR(2) the ACF and PACF exhibit 
similar qualitative features. 

In practice we look at the ACF and PACF which may be suggestive 
of the type of process to estimate. More generally we estimate models 
with smallish values of p and q and use diagnostic measures to choose 
between the models and to see if the model is in need of modification 
(see e.g. Hamilton (1994), Franses and Van Dijk (2000), Mills (1993). 


NONLINEAR TIME SERIES MODELS: 


Structural economic models are typically a mixture of linear and nonlin¬ 
ear structural equations. Budget constraints for instance are nonlinear 
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Figure 18.1a Random Variable 
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Figure 18.1b MA(1) process 
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Figure 18.1: Pth order autoregressive processes 
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Figure 18.1c AR(1) process, p = 0.9 


ACF PACF 
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Figure 18.Id AR(1) process, p = 0.5 ® 
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Figure 18.1: Pth order autoregressive processes (continued) 
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Figure 18.le ARMA(1,1) 
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Figure 18.If AR(2) 
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Figure 18.1: Pth order autoregressive processes (continued) 
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(in that they include the products of variables). In addition where fixed 
costs of adjustment are incurred agents may respond to shocks that drive 
them from equilibrium in a nonlinear manner, that is do nothing until it 
is worthwhile incurring the fixed cost. It would perhaps be surprising if 
the time-series behaviour of macroeconomic variables could, in general, 
be well approximated by ARMA processes. In recent years there has 
been a great deal of empirical work on this issue. 

An infinite number of nonlinear models can be proposed. In order to 
make progress, various simplifying assumptions have to be made. Anal¬ 
ogous to the ARMA(p, q) representation of a variable, we can assume 
that a variable can be explained in terms of a nonlinear function of finite 
number of past values of realizations of outcomes and shocks. 

Vt = h(y t -i,y t - 2 ,yt- 3 , -,yt-pUt-i,u t - 2 , -,u t - q ) + u t (69) 

where we assume again the Ut-i are independent zero-mean random 
variables. 

Priestley (1988) expands the right-hand side of (69) as a Taylor se¬ 
ries expansion to order one about an arbitrary but fixed time point. 
By further restrictions on this expansion, he obtains various ‘classes’ of 
nonlinear models.We consider the most frequently employed nonlinear 
models obtained from this procedure. 

The Bilinear Model 

Bilinear models were introduced into the statistical literature by Granger 
and Andersen (1978) and Subba-Rao (1981). The bilinear model is given 
by 


p q m k 

2/t = X a iyt-i + X Pi U t-i + EE Oijyt-i.ut-j + Ut (70) 

2=1 2=1 2=1 j = 1 

where the cq, j3 i and 0 t j are constant. 

We observe that the bilinear model has a linear ARMA component 
and is supplemented by terms which represent interactions between past 
noise and past ys, such as y t -iUt- 2 , yt- 3 Ut-i ■ 

A simple case of a bilinear model is the first-order model 


Vt =yo + aiyt-i +u t + duyt-xiit-i (71) 

In this model is i/ t is stationary if af + Q\ x < 1. The mean of the 
process is given by 
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Ey t 


yo + 9 no- 2 
1 — a\ 


(72) 


where a 2 is the variance of u. Expressions for the variance and covari¬ 
ances are more complex to work out — see Priestley(1988). 

Brockett (1976) demonstrates that the bilinear model can approx¬ 
imate a wide class of nonlinear relatioships to an arbitrary degree of 
accuracy over a finite time horizon. The bilinear model does not how¬ 
ever capture certain types of cyclical behaviour such as limit cycles (see 
below). It does capture series which exhibit ‘bursts’ of large amplitude 
or spiking behaviour. 

Some realizations of bilinear processes are shown in figures 18.2a-c. 
In each case u is normal with a mean of 0 and a standard deviation of 1. 
Initial values of lagged y were set at zero and the first 600 observations 
discarded. The ACF and PACF are reported to illustrate the (mislead¬ 
ing) ‘linear properties’ of the series. Figure 18.2b taken from Priestley 
(1988) is particularly graphic, illustrating a ‘burst of large amplitude’. 
y does not have constant values in the figure but the compression makes 
it appear so. The other figures illustrate the spiking effect that bilin¬ 
ear processes can produce. Bilinear processes have not been employed 
very much in economics. This is perhaps because the model is not obvi¬ 
ously derivable from an economic model. Maravall (1983) and Peel and 
Speight (1998)) provide examples of its use in modelling economic data. 


Threshold and Smooth Transition Threshold models 


As demonstrated by a number of authors — e.g. Dumas (1992), Ca¬ 
ballero and Engel (1992) — when agents face fixed adjustment or trans¬ 
actions costs adjustment to equilibrium will not be linear. Rather there 
will be a ‘band of inertia’ followed by action which will depend on the 
precise specification of adjustment or transactions costs. More generally 
we can think of variables behaving differently, so that some of their prop¬ 
erties, such as mean and variance, differ in different regimes or states of 
the world, like slumps or booms. 

One form which can be obtained from the Taylor approximation of 
(69) is a generalization of the linear AR model given by 

Vt = Vo( s t ) + ot\{st)yt-\ + OL2{s t )yt-2 + • • • + a p {st)yt~ p + u t ( 73 ) 

where yo(st), au(st), & 2 (s t ), *», ct p (st) are functions of states or regimes. 

Threshold models proposed by Tong (1990)) and their extensions, 
the smooth-transition threshold models, are one important class of such 
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Figure 18.2a Realization of bilinear process 6 
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Figure 18.2: Bilinear model 
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Figure 18.2c Realization of bilinear process 
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Figure 18.2: Bilinear model (continued) 


models which have been extensively employed in economics and finance. 
In these models the state s is modelled as a function of observed vari¬ 
able^), say x t . The Threshold Autoregressive (TAR) model of Tong 
assumes that the state is determined by the value of x t relative to a 
constant threshold value, A. When the x t is a previous value of y. so 
that x t = yt-d , where d is an integer greater than zero, the process was 
called by Tong a Self-Exciting TAR (SETAR) model. 

One example of such a model is the three-regime SETAR model 

Vt =V o + aoiyt-i + u ot if Vt -i < -A 
yt=y i + auyt -1 + uit if - A < yt -1 < A (74) 

yt = y 2 + oinyt-i + u 2t if y t -i > A 

where the u are assumed at this stage to be Gaussian random variables. 

If we interpret A as a transaction cost and set the constants y 0 . y 1 , y 2 
to zero, and «oi = <*21 = 0, an = 1, uot = u 2 t■ We would obtain a TAR 
model which might represent the behaviour of an arbitraged variable. 
For instance suppose y t was the deviation from covered interest arbitrage. 
We might suppose that within the transactions band the deviation is a 
random walk so that yt = yt -1 + u\t- Outside the transactions band 
arbitrage would make y* a random variable, possibly with very low error 
variance. 
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Another example is the two-regime model 

Ut =y 0 + OtOlVt-l + Ot02Vt-2 + Ot03Vt-3 + u ot if Dt- 2 < A 


Vt=V i + Oiiiyt-i + u 2t if Vt -2 > A (75) 

TAR models can exhibit many interesting properties. If we ‘switch 
the noise off’, i.e. put the random terms to zero and simulate the de¬ 
terministic component of the model (called the skeleton by Tong), the 
resulting nonlinear difference equations can generate cyclical behaviour, 
including asymmetric cycles, that is cycles of different length, as well as 
chaos (see below). This is also a feature shared by an extension of the 
threshold models known as smooth transition models. 

A smooth transition model has the form 

aiVt-i + (1 — Ft) 

i=1 

where ct; and <5,: are constants, Ut is assumed at this time to be Gaussian 
and F t is the continuous transition function, which is usually specified 
to be bounded between zero and unity. 

Ozaki (1978) and Haggan and Ozaki (1981) introduced the exponen¬ 
tial autoregressive model known more widely these days as the Expo¬ 
nential STAR (ESTAR) model. 

ESTAR Model 

In the ESTAR model 

F t = e-^ Vt - d ~ X)2 (77) 

where d is the delay,. 7 is a positive constant and A is a constant. 

The transition function for the ESTAR model is symmetric in devi¬ 
ations of y t ~d from A. We observe from (76) that as F t varies between 
zero and one we obtain an infinite number of different autoregressive 
processes each corresponding to a different state. 

An interesting special case of the ESTAR which illustrates some of 
the possibilities is the model 

y t = y t - ie _7(y '- l)2 + u t (78) 

When the deviation y t -\ is large (78) will give approximately y t = u t . 
Conversely when y t - 1 is small (78) will give approximately yt = yt-i+u t 
or yt —yt-i — Ut so that y exhibits behaviour which varies from where y is 
approximately random to where changes in y are approximately random. 



i =1 


kyt- 


IH 


(76) 
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The deterministic component of (78) is plotted in Figure 18.4(a) with 
yt—yt- 1 on the vertical axis and yt-i on the horizontal axis. We observe 
the symmetric nature of the plot and the non-responsiveness of y t — y t ~\ 
to yt.-i near zero, which is the equilibrium value in this model. This 
type of adjustment was employed by Michael, Nobay and Peel (1997) to 
model deviations of real exchange rates from equilibrium. It captures 
the idea that there is little response of real exchange rates to deviations 
from equilibrium when they are small but adjustment is proportionately 
faster the greater the deviation. 

The ESTAR model also has the property, unlike the bilinear or 
threshold model that it can exhibit multiple equilibria. In fact (78) 
can exhibit up to three equilibrium values. This is readily shown by 
taking the special case 

yt = aiy t -ie~' y( ' yt - 1 ' >2 + u t (79) 

Ignoring the error term, the equilibrium of the deterministic compo¬ 
nent of (79) is given by 

y t =y t _ l =y (80) 

Setting y t = yt- 1 = y in (79) we obtain the equilibrium solutions 
y = 0, =F( J y i ) so that for 0 < a\ < 1 the model has three equilibria. 
Not surprisingly in the general case of the ESTAR model (76) the model 
can also exhibit one, two or three equilibria which can be non-zero and 
different. Their stability properties can also differ so that for instance 
the lowest of the three equilibria could be unstable in one model but 
stable in another (see Byers and Peel, 2000). 

The ESTAR model can also capture cyclical behaviour. This is illus¬ 
trated in figure 18.3 where we have plotted observations generated from 
(i/i = 0.1, first 600 points discarded) 

y t = Ay t _ x e~ 0 - 1{vt - lY (81) 

In some circumstances such as “arbitrage” the symmetric nature of 
adjustment in the ESTAR model may make it the preferred nonlinear 
model. However symmetric adjustment between regimes may not be a 
property of the data or theory. A model which allows for asymmetric ad¬ 
justment but can also exhibit multiple equilibria and cyclical behaviour 
is the LSTAR model. 

The LSTAR model 

In the LSTAR model the transition function F t in (76) is given the 
logistic form 


1 -|- e.-i(.vt-d-X) 


(82) 
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Figure 18.3: ESTAR cyclical model 


where 7 > 0 . 

Differences between the ESTAR and LSTAR transition forms are il¬ 
lustrated in figures 18.4a,b and 5a,b where 1 — F t is plotted against y t -\ 
(the same figure would be obtained for yt-d )• In both cases as 7 gets 
larger the transition between the two extreme regimes F t = 0, F t = 1 . 
becomes quicker. Whilst the ESTAR transition is symmetric this is not 
the case for the LSTAR. We also notice that as 7 gets larger in the case 
of the LSTAR the transition from F t = 0 or 1 becomes almost instan¬ 
taneous as y t -d differs from A (set at zero in the figures). Consequently 
the LSTAR can approximate a two-regime SE(TAR) threshold model 
unlike the ESTAR model. 




*-» K-1) 

18.4a 7 = 2 18.4b 7 = 10 

Figure 18.4: LSTAR transition 


A further generalization of the transition function which can capture 
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18.5a 7 = 1 18.5b 7 = 10 

Figure 18.5: ESTAR transition 


a four-regime SETAR model as a special case is the quadratic logistic 
function proposed by Jansen and Terasvirta(1996). 

This takes the form 

^ i-| - e -i(yt-d-\i){yt-d-\2) ( 83 ) 

where 7 > 0 and Ai, A 2 are constants. 

The quadratic logistic function is very flexible and can approximate 
as special cases both the ESTAR and LSTAR by choice of parameter. 
Figures 18.6a and b provide a couple of shapes. 



18.6a 


18.6b 


Figure 18.6: Quadratic transition 


The smooth-transition models have been extensively employed in the 
last few years to model both macroeconomic and financial data sets. 
They appear to provide a parsimonious fit to many economic time series. 
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In the specifications of the threshold and smooth-transition models 
outlined above we have illustrated by assuming that the transition state 
s was a function of past values of y, ‘self-exciting models’. More gen¬ 
erally of course the transition state can be a function of either past ys 
or other variables x depending on the theoretical model underpinning 
the analysis. For example the transition variable in the analysis of in¬ 
flation might involve political variables if political parties were thought 
to have different policies. An alternative assumption is that the regime 
that occurs at time t cannot be observed since it is determined by an 
unobservable process which we can call s. 

In the case of two regimes we postulate, as with the case of the thresh¬ 
old model of Tong that the outcome follows a different n-autoregressive 
process dependent upon the state of the world which we can call states, 
St = 1 and s t = 2. 

In order to give content to the model we need to specify the prop¬ 
erties of the process generating st, the model most widely employed 
by economists is the Markov-Switching model developed by Hamilton 
(1989). 

The MARKOV-switching Model 

Suppose in both regimes the process follows an AR(2) process we have 
yt = Vo + otoiyt-1 + otmyt-2 + U t if St = 1 

Vt =Vi+ aiiyt-i + 01222 / 4-2 + u t if s t = 2 (84) 

The process s t is assumed to follow a first-order Markov process. 

If we assume that the probability that St equals j (= 1 or 2 in our 
example) depends on the past only through the most recent value s t -i 
(call it * = 1 or 2) 

prob(s t = j | s t -i = i) = prob{s t = j \ s t - 1 = *) = Pij (85) 

Such a process is described as a Markov chain with transition prob¬ 
abilities {p^, j = 1, 2}. The transition probability Pij is the probability 
that state i will be followed by state j. 

Writing out in full we have 

prob(s 4 = 1 | St_i = 1) = Pii, 

prob(st = 2 | st_i = 1) =pia, (86) 


prob(st = 1 | St_i = 2) = P 21 
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prob(s 4 = 2 | s t -1 = 2) = p 22 , 

Because probabilities sum to unity we must have that 

Pn T P12 - 1 1 (87) 

and 


P21 +P22 — 1 ( 88 ) 

Further properties of Markov chains and issues of estimation are set 
out in Hamilton (1994) and Timmermann (2000). 

Markov-switching models are an appealing way of dealing with peso 
problems (as discussed in chapter 14). Naturally if the transition proba¬ 
bilities are specified as functions of observed variables, as by for instance 
Diebold (1994), the models have similarities to the smooth-transition 
models. 


STATIONARITY VERSUS NONSTATIONARY 
VARIABLES 

In the discussion so far we have assumed that the variable y is stationary. 
If a series is non-stationary then its mean, variance and covariances are 
changing, an important violation of these earlier models’ assumptions. 
It will be clear from plotting many series that they exhibit a trend and 
hence are non-stationary. Plots of other series may be more problematic 
and it is clear we require formal tests to determine whether a variable 
is stationary and the appropriate test to render it stationary. Obvious 
filters to render a variable stationary are to remove the trend by regres¬ 
sion so that the deviations around trend are stationary or to difference 
the series. In fact the method employed to remove trend has crucial 
differences as we now illustrate. 

Consider removing a trend by regression of the variable 011 a time 
trend. We have 


Ut = a + /3t + u t (89) 

where a and (3 are the intercept and slope terms, t is the time trend 
and u t is a random disturbance which we assume to be white noise but 
can be any stationary ARMA process. Here the non-stationary variable 
y is represented by what is known as a deterministic trend, that is, the 
trend itself is perfectly predictable and is non-stochastic. Of course the 
addition of the stationary variable Ut results in variation about the trend 
so that y is stochastic, but in the long run the only information we have 
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about \jt is its mean (a + fit). Consequently, neither current nor past 
events alter long-term forecasts for this process. It is this feature which 
makes the trend deterministic. It also follows that the long-term forecast 
error will have finite variance and therefore, as pointed out by Nelson 
and Plosser (1982) uncertainty is bounded even in the indefinitely distant 
future. 

An alternative approach to dealing with trends is to difference a series 
to make it stationary. A process that needs to be differenced to make it 
stationary is known as an integrated process. An integrated process is 
said to exhibit a stochastic trend. 

When a series is integrated and one fits an ARMA process to the 
differenced series, the process for the ‘raw data’ is known as an ARIMA 
(p, d, q ) where d is the number of times the series has to be differenced to 
make it stationary (0, 1, 2 etc., though it can take non-integer numbers 
— a so-called fractional process, see below): p is the order of the autore¬ 
gressive component and q the order of the moving-average component. 

While the effect is to model the general movement of the series, there 
is a crucial difference between the two methods. To illustrate this differ¬ 
ence, suppose y t is given by the nonstationary process: 

Vt — P + Ut-1 + Ut (90) 

where u is white noise and the variable x t = Vt ~ Vt. , = P + u t is 
stationary, so y t is an ARIMA(0, 1, 0) 

(90) can be written by repeated substitution as 

Vt = yo + fit + Ut + Ut-1 + Ut -2 + Ut -3 + ... (91) 

where yo is some starting value in the past and the errors run back to 
this point. By comparison of (91) with (89) instead of a there is yo, and 
instead of the single disturbance u there is the sum of t disturbances. 
This illustrates that the two types of non-stationary process, namely 
the deterministic trend and the integrated process, have broadly similar 
forms, in particular the time trend. However, (91) is crucially different 
from (89). In (89) a is a fixed parameter whilst in (91) it is given by a 
past historical event, the starting point. In (89) the deviations from the 
trend are stationary whilst in (91) the error variance increases without 
bound as time passes. Consequently, the long-term forecast of y t from 
(91) will be influenced by the starting value and the variance of the fore¬ 
cast error will increase without bound as the forecast horizon lengthens. 
The economic importance of this distiction is crucial. If we model a vari¬ 
able as having a deterministic trend (being ‘trend-stationary’), we have 
made the assumption that shocks have a temporary impact on the vari¬ 
able in question. If we assume that a variable is an integrated process 
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and has to be differenced to make it stationary we are assuming that 
shocks have a permanent impact on the variable. 

Given these rather different properties, it is important to know for 
any particular noil-stationary series whether it is described by a de¬ 
terministic trend or whether it is an integrated process described by a 
stochastic trend. Nelson and Kang (1984) investigate the implications 

of making the wrong choice. They show that if a variable described by 

_2 

the process (90) is regressed on time, the R value will approach one as 
the sample size increases. Also, if the series is detrended, the autocor¬ 
relation function is determined entirely by the sample size, exhibiting 
high positive autocorrelation at low lags and spurious cyclical patterns 
at long lags. 

One area where these issues created much debate is whether real 
output is more appropriately described by a deterministic or integrated 
process (see Campbell and Mankiw, 1987). Currently, the available em¬ 
pirical evidence supports the view that many nonstationary economic 
time series appear to be generated by integrated processes (see Nelson 
and Plosser, 1982 and Phillips and Perron, 1988). Statistical tests to 
determine whether a non-stationary series is best described by a deter¬ 
ministic or integrated process have been developed by Dickey and Fuller 
(1981), Phillips and Perron (1988), Kwiatkowski et al. (1992). 

Cointegration 

In our discussion of non-stationary variables above we introduced the 
idea of an integrated series as being one which, after differencing, be¬ 
comes stationary. If a stationary series is defined to be integrated of 
order zero, or 1(0), a series Xt, is said to be integrated of order d if it 
becomes stationary after differencing d times or 

(1 -L) d x t =y t (92) 

where L is the lag operator and y t is stationary. To reiterate, the reason 
stationarity is important is because it is one of the basic assumptions 
made in modelling and forecasting. If a series is not stationary, then 
its mean, variance and covariances are changing so that models which 
assume these are constants will be misleading. For example, the ‘random 
walk’ series y t , given by y t = yt -i +u t where u t is a random disturbance, 
has an infinite variance. Any estimated variance will be finite and so 
must be an underestimate of the true value. It is also easily seen that 
y t , is J(l), and we note that if we add another 1(0) variable to y t the 
result will also be an 1(1) process. 

These considerations are also important where relationships between 
different series are concerned. For example, Phillips (1987) has demon- 
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strated that the standard f-tests in regression are not valid when the 
variables are non-stationary. More generally, Engle and Granger (1987) 
discuss the joint properties of integrated series. While the results ap¬ 
ply for any number of series, we will limit consideration to just two. 
Suppose two series x t , and y t are both 1(d), then in general any linear 
combination of them will also be 1(d), there is no relationship between 

the variables so the regression error will have infinite variance, though 

_2 

as noted by Engle and Granger the R will often be rather high — a 
spurious lit. However, it sometimes occurs that a combination of two or 
more 1(d)) variables is I(d — b) where b > 0. More formally, if a new 
variable, e t , can be defined by 


e t =y t - Ax t (93) 

where e t is I(d — b) then x t and y t are said to be cointegrated and A 
is called the constant of cointegration (or in the case of more than two 
variables the vector of constants) so that it becomes the cointegrating re¬ 
gression equation. So for example if real dividends are a non-stationary 
7(1) variable then for stock markets to be efficient under standard as¬ 
sumptions we must find that the real stock price is an 7(1) process and 
also cointegrated with real dividends so that the residual from a regres¬ 
sion of real stock prices on real dividends is an 1(0) variable. 

We now consider some of the properties of cointegrated variables. 
First, including a constant in a cointegrating regression does not have 
any effect. Second, if two variables are cointegrated then the constant of 
cointegration is unique (though not for n variables) This can be shown 
by replacing A in (93) by a new constant k which equals A + g. This 
gives 

et=yt- kxt = y t - Ax t - gx t (94) 

If y t — Ax t is stationary and x t is 1(1) only if g = 0 can (93) and (94) 
be consistent. Third, it has been proved that cointegration in levels of 
variables implies cointegration in their logarithms, while cointegration in 
logarithms of variables does not imply cointegration in levels. This point 
suggests that unless there are strong theoretical reasons for choosing a 
particular functional form then some experimentation with non-linear 
transformations of the variables in the cointegrating relationship is de¬ 
sirable. Fourth, cointegration implies the two variables will not drift 
apart, since e t , which measures the gap between y and x, and can be 
regarded as the ‘error’, is stationary with a mean of zero. This results 
in one interpretation of 


Ut ~ Ax t = 0 


(95) 
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as the long-run or equilibrium relationship between x and y. However in 
the case where say x t was a forecast of y t which applies in some efficient 
market tests the long-run equilibrium interpretation is not valid (e.g. 
Campbell and Shiller, 1988a,b). 

Fifth, Engle and Granger (1987) have proved that if x and y are 
both 7(1), have constant means and are cointegrated, then an ‘error- 
correcting’ data generating mechanism or error correction model (ECM) 
exists which takes the form 

A y t =y t - y t - 1 = Po~ Pi e t-i + lagged(Ay , Ax) + u t (96) 

and 

A x t = x t - x t -i = p 3 - p 4 e t _i + lagged(Ay, Ax) + ry (97) 

where e t is given by (93) above, and u t and rg are stationary error 
processes often taken to be white noise in practice though this is not 
required; also the errors could be correlated. In addition so that stability 
is ensured we require 

I Pi I + I P 4 M 0 (98) 

This last condition ensures that et-i occurs in at least one of the 
equations. The validity of (96) and (97) follows from x and y being 
7(1), so their differences are 1(0) and each term is therefore 1(0). It 
is also the case that not only must cointegrated variables obey such a 
model, but the reverse is true: data generated by an ECM must also be 
cointegrated. 

This result is of great importance because it links two previously 
separate areas: time-series models and ECMs. Error-correction models 
have been widely used in economics since Sargan (1964) and Davidson, 
Hendry, Srba and Yeo (1978). They require that the economic system 
has a well-defined equilibrium and that the speed of movement of vari¬ 
ables towards the equilibrium will reflect the distance the system is away 
from that equilibrium. Consequently, (95), the cointegrating equation, 
reflects the specification of the equilibrium of the system. The absolute 
value of the cointegrating residual et-i measures the distance from the 
equilibrium in the previous period. 

These results have implications for modelling and forecasting since 
the order of integration of the dependent variable in a regression must be 
equal to a linear combination of the explanatory variables as a necessary 
condition for a meaningful relationship between the variables to exist. 
In other words for equations to have 1(0) errors, the variables must 
also be 1(0) or, if 1(1), they must be cointegrated. This illustrates the 
need to examine the degree of integration of each candidate variable in 
a proposed model prior to estimation. 
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In order to provide further insight into the above discussion we show 
how the ECM can be derived from the data-generating process for two 
variables employing a simple example from Engle and Granger (1987). 

The data-generating process is given by 

x t + Py t = u t (99) 

x t +ay t = e t (100) 

where a, (3 are constants and the error processes u t and e t are given by 

u t =u t -i+v lt (101) 


e t = pe t ~i + v 2t with | p \< 1 


( 102 ) 


p is a constant and vu and V 21 . are error terms which have zero expected 
value and constant variances though quite feasibly non-zero covariance. 
Solving the pair of simultaneous equations for x t and y t from the above 
system with gives 


aut _ Pet 

(a — P) (a — p) 


(103) 


-tit e t 

(a - P) (a - P) 


(104) 


Since Ut is a non-stationary process (from (101) it is a random walk) 
we observe from (103) and (104) that x t and y t are integrated processes 
of order one, 7(1), since they depend on u t and the stationary 7(0) 
error process e*. However from (100) we observe that the two variables 
are cointegrated since a linear combination of the two variables, with 
cointegrating vector (1, a) is stationary. 

If we difference equation (99) and (100) and employ (101) and (102) 
we can obtain 


A x t + PAy t = vut 


(105) 


A x t + aAy t = Ae t = e t - e t _i = pe t -1 + v 2t - e t -i 

= v 2 t - (1 - p)e t -i (106) 


Solving for Axt and Ay t from (105) and (106) we obtain the error- 
correction forms 


Ay t 


i’2t - vu 

a — P 


(! ~p)et- 


a — (3 


i 


(107) 
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A x t 


avit - (3v2t (3{ 1 - p)e t -1 

a — (3 a — (3 


(108) 


If (3 = 0 in the above system so that xt is an exogenous process we 
could write (107) and (108) in the equivalent form 


A y t = 


v 2 t -A x t 

a a 


(1 -p)e t ~i 
a 


(109) 


A x t 


avit 

a 


( 110 ) 


The form (107) might be estimated for purposes of explanation and 
the form (109) for forecasting. 

As the error processes in (101) and(102) are made more complex we 
obtain the general form of linear error-correction mechanisms given by 
equations (96) and (97). 

We observe here that Fama estimated an error-correct ion form in his 
tests of efficiency of the exchange market discussed in chapter 14. If 
the spot rate and forward rate are integrated processes then efficiency, 
assuming a stationary risk premium, implies they are cointegrated. Con¬ 
sequently in the regression 


St — <5o + Sift + ft 


( 111 ) 


where s t is the spot rate and f t the forward rate we would expect to 
find that the error e t is 1(0) regardless of the order of integration of s t 
and ft (though typically found to be 7(1) processes these could take 
higher values, for instance in times of hyperinflation if inflation is an 
1(1) process). Early tests of efficiency examined whether the estimate 
of 8i was significantly different from unity. It was then recognised that 
inefficient forms of expectation formation could be consistent with this 
property as long as expectation forecast errors were stationary. The 
equation Fama estimated for 7(1) data to test efficiency, namely 

As t = 7 0 +7 i(/ 4 _i - s t _i) +u t (112) 

is a restricted form of ECM (restricted in the sense that he could have 
added further lagged changes in forward rates and spot rates and tested 
whether their coefficient values too were significantly different from zero). 


Nonlinear Error Correction 


While the above derivation of the ECM employed a linear system for 
ease of exposition there is nothing in statistical theory that prevents 
the ECM from having a nonlinear structure and indeed much economic 
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theory suggests it should have. Consequently much recent research has 
investigated the efficacy of nonlinear error-correction forms — see e.g. 
Enders and Granger (1998); Peel and Taylor (2002). 

These have generally employed some form of threshold or smooth- 
transition threshold structure since these are implied by transactions or 
arbitrage bands. So, for instance, we might suppose that adjustment to 
deviations from say purchasing power parity or equilibrium money de¬ 
mand is nonlinear due to transactions bands. Peel and Davidson (1998) 
also employ a Bilinear ECM where the economic rationale is less clear (it 
could be where adjustment speeds are directly related to the magnitudes 
of shocks in the system). A smooth-transtion ECM for an 7(1) variable, 
y t . which is cointegrated with ay would take the form 


Aty = 7o + 7 , A*Axt_j + Set-i + 

i —1 i —0 

( o p 

(1 — F t ) < 7 10 + 7 lt A y t —i + Ai,:Aay_,: + 8\e t ~ 


i=l 


i =o 


(113) 


where the 7, A, 6 are constants and e is the cointegration error. The 
lag on the cointegrating residual is often one but need not be. F t is the 
transition function as in the various threshold models outlined above. 
The transition variable, .s t , in the transition function is an J(0) variable, 
often the lagged cointegrating variable but not restricted to this variable. 


Vector Autoregressive Models 


If we consider equations (107) and (108) above we could write them in 
the equivalent form 


Vt 


i’2t - Vlt 
a — 3 


(! ~P) 

a— (3 


Xt.-l + (1 — 


(1 - P)a 

a — /3 


)Vt 


1 


(114) 


av lt -pv 2t , 3(1 -p) a/3(l-p) 

x t =- n -b (H-H-3— 

a — p a — p a — 3 

In this form the sytem would be known as a vector autoregressive 
model (VAR). 

Sims (1980) as part of a critique of conventional model building in 
macroeconomics proposed the estimation of unrestricted reduced forms 
in which all the variables in a chosen system are treated as being en¬ 
dogenous for use in forecasting (and policy evaluation). The linear VAR 
has the form 


y^t f(yit— 11 y^t— 2 > * • • >yit p 5 y~ 2 t 1 ? * * • ? y 2 t p 3 * * • 5 2/fei—> ykt—p) 
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2 / 2 i / {Uit-i > Vit -2 i ' • ■ > Uu-p, 2 / 2 t — 1 5 • ■ ■ > 2 / 2 t—pi ■ ■ ■ i Vkt- 1 , ■ ■ ■ ■ Vkt—p) 

(116) 

2 Ikt = /(2/if-i i 2/1 1-2 1 • • • > VU—pi V' 2 t — 1 j • • • 1 2/21—• • • 1 Vkt- 1 , • ■ ■ 1 Vkt—p) 

where there are 2c variables and the common lag length p. Notice that 
the right hand side variables are the same in each equation and do not 
include any current values. In these circumstances ordinary least squares 
is a suitable estimation method. 

Clearly if we estimate a VAR we must determine the form in which 
variables should enter the VAR. Sometimes researchers employ first dif¬ 
ferences of variables since this makes them stationary. Clearly if variables 
in the system are cointegrated this would imply that information on the 
equilibrium of the system, the cointegrating residual, has not been em¬ 
ployed so that the VAR is misspecified. A model estimated in levels 
would implicitly capture the cointegration restriction. Fuller discussion 
of VARs is found in, for example, Hamilton (1994). 

Fractional Integration or Long Memory 

So far we have assumed that the filter to make an integrated series sta¬ 
tionary is an integer number, 1, 2, etc. However Granger and Joyeux 
(1980) and Hosking (1981) have introduced the idea of fractional inte¬ 
gration. 

The fractionally integrated time series model can be defined as 

(1 -L) d y t =u t (117) 

where lit is a ARMA(p, q) process and d is a 11011 -integer where — l<d< 

1 . 

Such a process is called an ARFIMA(p, d, q) process (see Diebold and 
Rudebusch, 1989). 

The difference operator on the left-hand side of (117) can be expanded 
as a binomial expansion to give 

(1 - L) d = 1 + (-d) L + id (d - 1) L 2 + id (d - 1) (d - 2)^ L 3 + 

ld(d-l)(d-2)(d-3)L 4 + ... (118) 

We observe from (118) that y t is a function of all past values of y. 
For 0 < d < | the autocorrelation function, ACF, for y declines at a 
much slower hyperbolic rate towards zero than the ultimately exponen¬ 
tial decay of a standard ARMA process. The hyperbolic decay of the 
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memory of a fractionally integrated process enables ARFIMA processes 
to model dependence between observations at long range. 

When d > i the process is nonstationary. But at the same time, in 
contrast to the 1(1) (or unit root) case, these processes exhibit eventual 
mean reversion. 

The formula in (117) is also valid for — 1 < d < 0, which can be 
thought of as the case obtained by the simple differencing of a fractionally 
integrated process. 

Because fractional integrated series can be nonstationary there is a 
developing literature on fractional cointegration (see Davidson, 2000) 
and all the associated issues of spurious correlation. 

A large number of authors have recently reported empirical results 
which suggest that a number of macroeconomic and financial time series 
exhibit autocorrelation functions that decay hyperbolically rather than 
geometrically and can be described by fractionally integrated ARFIMA 
( p,d,q ) processes — e.g. Baillie et al. (1996); Ding, Granger and Engle 
(1993); Ding and Granger (1996). 

The rationale for why economic time series should follow fractional 
processes is less clear. Granger (1980) provides an explanation whereby 
aggregation over heterogeneous micro agents who display different de¬ 
grees of persistence can generate an aggregate series which displays the 
fractional property. This rationale was exploited by Byers, Davison and 
Peel (1997) in their analysis of political popularity. It is less clear that 
this rationale can explain the apparent fractional property of macroe¬ 
conomic series. Recent work is suggestive that the apparent fractional 
property of some macroeconomic data may be a misleading linear prop¬ 
erty. Granger and Terasvirta (1999) and Byers and Peel (2000) show 
how nonlinear models can exhibit outcomes that appear fractional. For 
instance the LSTAR and ESTAR models can generate data which ap¬ 
pears to have the long memory property. Also Engle and Smith (1998) 
and Granger and Hyung (1999) show how structural breaks or Markov- 
switching models can generate data which appears to have the d prop¬ 
erty. It seems clear that there will be a lot more research devoted to 
explanation and analysis of data with the d property. 


TIME-VARYING VOLATILITY 


In the univariate models considered above namely 

yt = f(yt-i,yt- 2 , ...)+u t 


(119) 
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we have assumed that the error had expectation zero and was white noise 
or a Gaussian white noise process. 3 Also that the error had constant 
variance both unconditionally, E(u 2 ), and conditionally, E(u f | 
so that E(u 2 ) = E(ii 2 \ Qt-i) — <r 2 where <r 2 is the constant variance 
and Cl t -1 is the information set consisting of all relevant information up 
to and including time t — 1. 

It has been noted for a long time that the squared residuals from 
fitted ARIMA processes for many financial time series exhibited serial 
correlation e.g. Mandelbrot (1963a,b). The same finding was also dis¬ 
covered later for many macroeconomic time series (see e.g. Engle, 1982). 

Such a finding could be evidence of mispecification of the mean func¬ 
tion. For instance we fitted an AR(1) process to the bilinear process: 

Ut = ayt- i + Pyt-iUt-i + u t (120) 

where a = 0.5, f3 = 0.4 and u t is Gaussian white noise, N(0, 1). We 
obtained a first-order autocorrelation coefficient for the squared value of 
the residuals from a fitted AR(1) process of 0.28 in a sample of 400 (the 
first 600 observations were discarded). This value is significant. 

However there are numerous models based for instance on diffusion of 
information flows in financial markets or pricing of risk in asset markets 
which provide a rationale for serially correlated squared residuals (see e.g. 
Bollerslev, Chou and Kroner(1992)). We should note of course that a 


! Having discussed some nonlinear processes we can be clearer about 
this distinction. A process having the properties E(ut) = 0, E(uf) = 

a 2 =constant, E(ut—i, ut— q ) = 0 for i A q is known as white noise. Consider now the 
example taken from Priestley (1988): 


ut = vt + Pvt-iVt -2 (al) 

where vt is white noise but also independent of all past information. It follows that 
E(ut) = 0 (since 0E{vt—\Vt—2) = 0), also E(u = A). The autocovariance function 
is given by 


E(u t ut+s) = E{v t v t +s + /3vt-ivt-2Vt+ 3 + f3v t vt+s-ivt+s-2 + 

/3 2 v t - 1 vt- 2 vt+s-ivt+s-2} (a 2 ) 

For all s 7 ^ 0 each of the terms in (a2) has zero expectation so that ut is an 
uncorrelated process. Consequently it is a white noise process. However because ut 
has the nonlinear process specified in (al) we can construct a forecast of ut +1 which 

is non-zero and given by ut- |_i= (3vtvt—i- Consequently a stronger condition than 
assuming a white noise error is to assume the error is independent of all previous 
information (i.e linearly and non-linearly). Such a process is called an independent 
white noise process. When the variance of the error is also normally distributed the 
independent white noise process is called Gaussian white noise. 
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predictable pattern of the variance of asset prices would not imply market 
inefficiency. Rather the issue would be whether the econometricians’ 
forecasts of variance are superior to those embodied in e.g. option prices. 

Models of time-varying volatility relax the assumption that the con¬ 
ditional variance is constant. It is assumed that the conditional variance 

E(u 2 | fl t - 1) = h t (121) 

where ht. is some non-negative function. 

Engle(1982) specified the process 

u t = z t ht' 5 (122) 

where z t is independent and identically distributed with zero mean and 
unit variance. In early work it was also assumed that 2 was normally 
distributed, which we also assume, but this assumption can be relaxed. 

From (122) and the properties of z, it follows that the distribution 
of Ut. conditional on the information set is normal with mean zero and 
variance ht- We can also derive that the unconditional variance of u t is 
constant since 

E(u 2 ) = E{E(u 2 | SVO) = E(u 2 | n t -i) = h t (123) 

The distinguishing features of time varying volatility models is in 
the specification of how h t evolves over time. Engle(1982) proposed 
AutoRegressive Conditionally Heteroscedastic (ARCH) models, where 
the conditional variance is a linear function of past squared errors. The 
ARCH model has the form 

ht = 7o + 7i w t-i + l 2 u t -2 + ■■■ ( 124 ) 

where the 7 are nonzero constants to ensure ht is positive. In a financial 
market setting the idea of (124) would be that turbulent periods of large 
u 2 are followed by further turbulence, so-called volatility clustering. For 
certain applications the assumption of long lags in the squared errors may 
be more parsimoniously captured by the form proposed by Bollerslev 
(1986) which is the Generalised ARCH or GARCH(p, q) process given 
by 

q p 

h i = 70 + a i h t~i ( 125 ) 

1=1 i =1 

The reader will have noticed the similarity of (124) to an MA process 
and (125) to an ARMA process. If we define 6 t = u 2 — h t . = h t {z 2 — 1), 
obtained by squaring (122), we can write for instance a GARCH(1, 1) 
process 

ht = 7o + Pi u t-i + «i h t - 


1 


( 126 ) 
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equivalently as 

= 7 0 + (J3 1 + + Ot — a\6 t _i (127) 

From (127) we observe that we can write the GARC'H(1,1) process 
as an ARMA(1,1) process in the squared errors. Note also that E(6 t \ 
f2t-i) = 0. As with an ARMA(1,1) process we would require /3 1 +ai < 1 
for stationarity in (127). 

As nonlinear models in the conditional mean function discussed above 
were natural extensions of the ARMA processes so nonlinear models of 
time-varying volatility have been developed. Similarly mirroring exten¬ 
sions of the linear model there are integrated GARCH processes and frac¬ 
tionally integrated Garch processes (FIGARCH).These different models 
allow for asymmetric impacts of shocks on conditional volatility, long 
memory, hyperbolically declining impacts, etc. These methods have 
been extensively employed in univariate modelling of series in macroeco¬ 
nomics and finance and are well covered along with issues of estimation 
and statistical properties in Franses and van Dijk (2000). 

Chaos 

In the models discussed above it has been assumed that the models are 
stochastic. In the last decade or so there has been discussion of whether 
outcomes in economics, particularly asset markets and macroeconomics, 
might be the product of a chaotic process. A chaotic process is a de¬ 
terministic process, so that the explanation of outcomes does not rely 
on shocks. Perhaps the best way to illustrate the idea is to consider the 
nonlinear difference equation 

Dt = Ay t ~i(l — t/t-i) (128) 

where A is a constant, 0 < A < 4. 

The deterministic nonlinear difference equation (128) has been termed 
the simplest nonlinear difference equation. It can be obtained as a re¬ 
duced form from the more general second-order difference equation 

Z t = a + bZ t _ i + cZ t _ 2 (129) 

by letting 4 Z t = A y t + 7 . 


4 By substitution 
C7 2 + (6 — 1)7 + a = 0 
A = — ^ 

A = 2c7 + b 
and 

A 2 - 2A + 2b - b 2 + 4ac = 0 
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In order to analyse the dynamic behaviour of equation (128) we first 
determine the points of equilibrium (y) obtained where y t = yt- 1 = y 
in (128).We find by substitution that there are two equilibrium points 
given by 

V = 0, and y = ^ (130) 

The slope of the function is given by ■ This is equal to 

-P*-=A-2Ay t _ 1 (131) 

dyt- 1 

(131) implies that the graph of y t against yt-i has a maximum value 
at yt —i 2 * 

In figure 18.7a we plot the the relationship between y t and y t - 1 - 
called the phase curve — in the range 0 < yt -1 < 1 for three values of 
A. The graph is liill-shaped and known as a tent map. By changing 
the parameter A, we can adjust the height and steepness of the hill. It 
turns out that the ‘tuning parameter’ A is crucial for the dynamics of 
the system. The value of A is also sometimes called the ‘knob setting’. 
In order to illustrate the dynamic behaviour of the system we can draw 
a 45° line to give the points where y t = y t -\ = y. Clearly the points of 
intersection of the graph of (128) and the 45° line are the equilibrium 
points given by (130). This is illustrated in figure 18.7b with A — 2. To 
show how the dynamics of the system evolve we start the system at the 
arbitary non-equilibrium initial value in the unit interval, yo = 0.2. This 
value generates a value of y( 1) = 0.32 in the next time period (point 
B). We now wish to repeat the process, this time commencing from 
y(l). To do this we move horizontally to point C on the 45° line. The 
point vertically below C is y{ 1) on the horizontal axis, because the two 
coordinates of any point of the 45° line must be equal. From y(l), we 
move directly to point D on the phase curve. Continuing in this fashion 
we will trace out the path of y. In this case the path converges to the 
equilibrium point K where y = 0.5. 

In order that we converge on an equilibrium point it is necessary that 
the slope of the phase curve at the equilibrium point has modulus less 
than unity. (Note also that for any starting values of yt-i which do not 
begin in the unit interval (i.e. between 0 and 1) the system tends to 
negative infinity). 

The slope of the phase curve at the two equilibrium points is 

dy t 
dy t -i 


= A - 2Ay t - 1 


(132) 
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18.7a Phase curve 



Figure 18.7: Phase curve 



18.7b Phase curve, dynamic behaviour 



18.7d Segment of time-path 


= A when y = 0 (133) 

and 

= 2 — A when y = 1 — — (134) 

It follows that for 0 < A < 1, the equilibrium point of the system is 
zero since (133) has slope less unity whilst (134) has slope greater than 
unity. 

For the case 1 < A < 3, we find that the equilibrium point, 1 — 1 /A, 
is stable or ‘attracting’, while that for y is ‘repelling’ or unstable. In 
other words, any deviation from zero causes the system to move further 
away, whilst any paths in the neighbourhood of 1 — 1/A, converge on 
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the equilibrium point. We note further that for 2 < A < 3, dy t /dy t -i 
will be negative from (134), implying ultimately stable oscillations in 
the trajectory of y around the higher equilibrium point from any initial 
starting point apart from equilibrium. 

When A is greater than or equal to 3, the slope at the higher equilib¬ 
rium point becomes unstable. In the case of a linear equation if the slope 
is greater than one in absolute value the oscillations will be explosive (i.e. 
y t will tend to infinity). However, with the hill-shaped phase curve, this 
cannot occur because, as the oscillations increase, the path will even¬ 
tually ‘expand into’ the positively-sloping side of the hill and produce 
a value of yt+j closer to the equilibrium point than some earlier point 
in the trajectory j/t+j-fe- From this point the cyclical path will begin 
converging towards the equilibrium point. However, since the equilib¬ 
rium point is unstable, convergence cannot occur so that the path will 
once again at some point begin to diverge. This is illustrated in figure 
18.7c, with A = 3.5, where we note that starting from initial condition 
I/O, point C is closer to the equilibrium point (.K = 0.714) than is point 
E. 

As A is gradually increased between 3 and 4, a plethora of differ¬ 
ent dynamic paths of the system are exhibited. For A equal to or just 
greater than 3 the path converges on a stable two-period cycle. As A 
is gradually increased, the two-period limit cycle becomes unstable and 
exactly at this point a stable four-period cycle emerges (also illustrated 
in figure 18.7c). As we continue to increase A the four-period cycle itself 
becomes unstable and at this point is replaced by a stable eight-period 
cycle. As A is further increased the process continues until ultimately the 
path of y evolves into an infinite number of cycle lengths. Feigenbaum 
(1980) demonstrated that the interval of A over which a cycle is stable 
shrinks at a geometric rate. In particular, each interval is approximately 
4.669 times smaller than the previous interval. He also showed that this 
property is universal for all functions that have ‘hill-shaped’ maps (for 
example, a sine wave). The period doublings as A increases are known 
as ‘period doubling bifurcations’. At points of bifurcation the system 
under consideration changes discontinuously. This is one possible path 
to chaos. Although the cycles initially only have even periods it turns 
out that cycles with odd numbers of periods eventually begin to appear. 
Initially these will have very long periods, but in due course they are 
joined by cycles of shorter and shorter duration which encompass every 
positive odd integer . 

As we continue to increase A we find that there are essentially an 
infinite number of initial values of A for which the time path of A never 
repeats itself. When a system starting from an initial value never repeats 
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Row 

Value A 

2/o 

2/io 

2/23 

2/72 

1 

3.95000 

0.3000 

0.41538 

0.45516 

0.06589 

2 

3.95000 

0.3001 

0.38369 

0.69396 

0.09839 

3 

3.10000 

0.3000 

0.62339 

0.75924 

0.58104 

4 

3.10000 

0.3001 

0.62362 

0.75918 

0.58104 

5 

3.95001 

0.3000 

0.41471 

0.24035 

0.41117 


Table 18.1: Path of y in equation (128) 

itself it is said to exhibit aperiodic motion. Aperiodic motion does not 
imply that the observed patterns appear to have no structure. It is 
possible, for example, to distinguish patterns which look like cycles but 
which disappear after a number of periods. Systems exhibiting aperiodic 
motion can be thought of as chaotic. 

The different types of dynamic behaviour which we observe as we 
vary A illustrates the notion of an attractor. An attractor in a linear 
system is a point, more generally a subspace, to which the system con¬ 
verges. The fixed-point equilibrium is defined as an attractor of period 
one. Limit cycles of whatever period are also types of equilibrium of a 
system, and are known as periodic attractors. When the system is in 
the chaotic region we have another type of attractor known as a strange 
attractor. The strange attractor is an attractor for all paths in its neigh¬ 
bourhood. Unlike limit cycles, trajectories within the strange attractor 
are not periodic, and no point is ever visited twice. 

In figure 18.7d we plot a segment of the path for a value A = 3.95, 
which is within the chaotic region. We notice that y appears quite 
volatile but always remains within the 0-1 range. We also see that peri¬ 
ods of relatively large cyclical oscillations can be followed by periods of 
relative stability (e.g. observations around 2340). Such abrupt qualita¬ 
tive changes in the path of y are characteristic of chaotic systems (and 
some non-linear stochastic equations, for instance the bilinear simulation 
of figure 18.2b). 

Chaotic systems possess a number of other interesting properties. 
One is that the time path of y is highly sensitive to initial conditions 
(ijo in our systems). Table 18.1 illustrates this property. In row 1 we set 
out, for a value of A of 3.95 (within the chaotic region), the values that 
X takes after 10, 23 and 72 from a starting value of 0.3. In row 2 we set 
the starting value of A at 0.3001. We observe that after ten periods to 
live decimal places the paths are close together, but after 23 periods the 
values are becoming different. 

_2 

In fact, after 500 iterations the R correlation coefficient between the 
two series is close to zero at —0.02 and quickly approaches zero. In rows 
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3 and 4 we carry out a similar experiment with A = 3.10 (outside the 
chaotic region). Now the values of y remain close together and the R 
after 500 iterations is approximately 1.0. The sensitive dependence of 
the path of y on the initial conditions in a chaotic region of a model 
was termed the ‘butterfly effect’ by Lorenz (1963) in a reference to the 
possibility that if the weather system is chaotic, a butterfly flapping its 
wings may set off a sequence of meteorological events that result in tor¬ 
nadoes, the sensitivity of the system being such that minute changes in 
initial conditions give rise to outcomes which are dramatically different. 
We also note that since equation (128) is deterministic, the path of y 
is totally predictable if we know the exact initial condition. However, 
the sensitivity to initial conditions informs us that in all practical situa¬ 
tions, where there will almost inevitably be errors in measurement and 
rounding errors in computation, it will almost certainly not be possible 
to make meaningful long-term forecasts 

A second interesting feature of chaotic systems is that the time path 
exhibits sensitivity to small changes in parameter values. This is also 
illustrated in Table 18.1 where in row 5 we set out values of y at various 
points on the path for a value of A of 3.95001, and a starting value of 
0.3. The value can be contrasted with those in row 1 where A has the 
value 3.95. The values soon become quite different. 

A third feature of chaotic processes (shared by some other non- 
chaotic deterministic processes) is that the observed time paths appear 
to be random or those of a linear/non-linear system subject from time 
to time to ‘very large’ random shocks. Frank and Stengos (1988) illus¬ 
trate the first possibility by considering the autocorrelation function of 
equation (128) for the first 1000 iterations with yo = 0.3 and A = 4. 
On the basis of the usual statistics, we would accept that the series is 
random. No doubt the fact that chaotic data can appear random may 
have motivated, in part, the applications to financial data, given that 
high frequency data changes in asset prices are well described by a white 
noise process. 

It is certainly suprising to find the rich number of different paths that 
can be generated by the simple equation (128) including chaotic out¬ 
comes. The macroeconomy appears much more complex and the ques¬ 
tion that naturally arises is the potential applicability of chaos in macroe¬ 
conomics. A number of statistical methods now exist which, given a suffi¬ 
ciently large number of observations, can determine whether an observed 
time series is chaotic or has the characteristics of a chaotic series (see, 
e.g.,Grassberger and Procaccia, 1983, Brock, Hsieh and LeBaron,1991). 
One of the tests is based upon the correlation dimension. If one formu¬ 
lates and estimates a model then the residuals should be independent 
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of past information, i.e. unforecastable. The BDS test of Brock et 
al. (1996) is employed to test the null that the residuals have no re¬ 
maining structure. Essentially if a series is random then plotting the 
residual against the lagged residual (say normalised to lie between zero 
and one) would fill the square with points. Similarly in three dimen¬ 
sions plotting the residual against the first and second lags would fill 
the cube. And so on into n dimensions. The test is based on exam¬ 
ining whether the relevant space is filled. A chaotic process leaves an 
imprint. This is illustrated in figures 18.8a and b. Figure 18.8a is a scat¬ 
ter of 20,000 points of Gaussian white noise plotted against its lagged 
value. There is no relationship: the space will clearly be filled with 
enough observations. In figure 18.8b the scatter is the residuals from a 
fitted ESTAR process to 20,000 observations from the chaotic process 

y t = 3.97y t _i(l — y t -i).{yo = 0.1). The ESTAR process which has two 

_2 

lags of y has a fitted R of 0.9998, a very high fit to the data, neverthe¬ 
less we observe that the scatter of the residuals at t and t — 1 has not 
filled the space but rather left a distinctive pattern. 

Though wide use has been made of these test statistics, there is per¬ 
haps as yet no convincing empirical evidence of chaotic behaviour in 
the economic series examined to date (mostly asset prices or macroe¬ 
conomic series) — though see De Grauwe, Dewachter and Embrechts 
(1993). However, a number of theoretical models generating chaos have 
been developed — e.g. Benhabib (1992), Dechert (1996), Grandmont 
(1986). 

CONCLUSION 

The purpose in this chapter has been to present a necessarily brief intro¬ 
duction to some of the time-series methods employed in macroeconomics. 
We have not discussed statistical tests which can be found in the text 
books cited. Also we have not discussed (amongst other omissions) neu¬ 
ral nets or non-parametric methods. These are of interest but space 
precludes their discussion — see e.g. Franses and van Dijk (2000), Har- 
dle (1990). One value of these methods is in presenting the theoretician 
with much more information about the nature of the ‘stylized facts’ the 
theoretical models have to explain. 
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Figure 18.8: Chaotic data and ESTAR transformation 
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Errata - Advanced Macroeconomics, a Primer by Patrick Minford and David 
Peel (second impression) 


page 18, second para, equation should read p t — p t _= k\p t _ 1 — pl_i\ 
p 22, second line of box should read: Remembering that Alnxt = 
p 27 last line to read: pt + apt -1 + bpt -2 = 0 
p 92 second equation from bottom to read: <j> = 

p. 158 bottom line left hand side of equation, numerator to read: a — a 
p. 239 equation (1), bottom of page, to read: Y t = A t L^K\~ a 
p. 240 equation (4) to read: r t = (1 — a)-^; that is, k(r) = (yzyy) -1 

p. 243 equation (10) to read: Y t = (exp 

-1 

p. 290 equation (64) at bottom to read: cq = E 0 a[(3(l + ro)] 1_7(1 ~ p) ...(as 
before) 

p. 291 second line to read: where —Ai = —— 

C 1 _ 

line after equation (68) to read: where K 0 = as before 
p. 302 equation (8) denominator term in e t to read: 1 + a 2 (A — /?7 2 ) 

OO 

p. 316 seventh line to read: It maximizes E t= 0 ]T) ^ 4 u(c t ) subject to 


t =0 

■dt_i(l—7f) ' 
St 


p. 325 equation (16) term in s' t _ 1 to read j : 

equation (17) to read y t = (1 — lt) n d t (st — [1 — 7r]sJ' / ) 
p. 326 equation (22) rhs numerator to read: ....(1 — ltY~ 1 {st — [1 — 7r]s*') 


equation (23) lhs to read j 
equation (25) to read s t 


(1-y) 


p t r t 

= 1 = s': 


e t 

et+i 

St 


— °t 1 

p. 327 equation (28) lhs to read ...d t _i(l — Trjsjij 

p*_i....d*_i(l-7r)4_ t 


same equation rhs to read 


et 
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CORRIGENDA- Advanced Macroeconomics - A Primer (2002) 

The following errors have been discovered to this point (3/2003). They will 
be put right as soon as new printings permit. 

p.9, para. 2, line 2 should read ...(Wicksell, 1907)... 
p.28, line 2: denominator of a should be 1 + s — r 

p.62 line before eq. (67) reads ’ ...at t — 1 to obtain:’. Line after eq.(69) 
reads ’where Ai and A 2 are ...’ 

p.64: in Figure 2.5 the label on S’S’ should read (yt+i = p[yt — U*} + V*) 
Eq. (74) first line insert opening large square bracket before i? _1 ; and in 
third line insert the same before 1 — -r^—B^ 1 . 

p.65 in second line of eq.(78) the first expected price term sould read E t -\Pt+i 
p.66 Fig. 2.6 should be entitled The effect of an anticipated shock. 
p.67 end of first para: insert (Box 2.1 shows the algebraic workings). In Box 
2.1 below line 1, after ’illustration’ insert ’in Fig. 2.6’ 
p.69 first line should read ’sets A\ = 0, so that:’ 

3 lines before eq. (95) should read ’....on how this set (m t , E t rn t +i) changes 
at eact t.’ 

In equation (95) insert opening large square bracket before 1 — j^B^ 1 . 
p.71 line before eq. (105) should read ’Solving (104) for k -2 to obtain:’ 

In equations (100) and (101) insert opening large square bracket respectively 
after ... = —a and ... = — p(l + a) 

p.78: LHSofeq. (129) should read E t _ip t+ i. Insert lull stop after’practice’ 
in penultimate line. 

p.80 In eq. (5) RHS the symbol n should be inserted before (y t -1 — 
E t - 2 yt-i)- the same symbol should be inserted in the gap before the = in the 
next line; and again in eq (6) after the first = to read and again in 

the line below the equation to read ’ where A = 1 — k so that A = j.’ 

OO 

p.92 RHS of equation (6) should read n i e t-i + P* and RHS of equation 

i=0 

OO 

(7) J] +P* 

i —1 

p.99 In Figure 3.4 top righthand quadrant insert closing bracket after E t p t = 
P* 

p.101 In equation (29) the (j) should have v, e, u subscripted. 
p.102: 3 lines after eq. (35) insert opening large square bracket after — 
and at end of (36) summation below should read i = 0 

p.113 in eq. (17) last term on first line should be /3( 1 + —)(pt — E t _\p t ). In 
eq. (19) insert another bracket after /?(1 + -) 
p.114 eq. (22) LHS to read: ... — p(yt -1 — y*) 

OO 

p.115 eq. (24) RHS last term to read —a ( 7^+1 — 7 Ti)e t -i and eq. (27) to 

2=1 

read 0 = (1 + a)ni — mri + i 

p.123 3 lines before eq. (56) should read: ’Let us use (55) in place of (3), 
together with the rest of the model (2), (4) and (5); then...’ 


1 



p.150 para.2 line 6 : to read ...In period t + I ...Next equation RHS to read: 
... — 7Tt + i + ab or nt.+i = ab. Five lines on U w in place of u w and in next but 
one equation RHS to read ... — 6ab(n f ^\ — i r^) 

p.151 para. 3, 3rd line from end to read ’government does not...’ 
p.161 In Figure 5.7 after Effect of shocks the large bracket should read 
( (i .Along the left of the vertical axis, replace a 2 by a 2 and the axis 
label should be —EU in place of EU. The top line of the figure should be 
labelled —EU S in place of EU S . 

p.166 The line after eq (5) should read ’... ;<72 = (i + °)(i+o 5 | g) i •••’ 
p.167 sentence after eq( 8 ) line 2 should read: ’-in this instance it will be 
more or less table depending on /i.’ 

p. 177 in equations (28) and (29) and also in the first and third lines after 
eq. (29) the denominator expression throughout should be 1 — /3£(1 — £) 
p.196 eq (18) closing bracket to read (b' t A b') 
p. 230: eq. (13): insert large square bracket before 
p. 245: eq. (20): insert large square bracket before 1+ 
p. 263 on RHS of eq. (9) the first term should read rp,t 
p. 276 eq ( 6 ) middle bracket to read [A 0 + yo — Co] 
p. 277 eq(12) RHS should open and RHS of eq (14) (3 t+1 
p. 278: Eq. (16) RHS should read , ...ct+i)(pt+i.... / Two lines on should read 
’....is for (16) to hold’ Eq. (19), first line: insert large square bracket after E t . 
Five lines above should read ’Equations (16) and (17) therefore...’ 

p. 279 Eq ( 21 ) RHS should read b t+ i/Rt • Para before eq. (26) should read 
’Using the two budget constraints, (21) and (25), and fruit market clearing, 
(24), Walras’ ....’ In the next para to the end of the page the equation numbers 
in the text should be increased by 1: i.e (220 becomes (23), (18) becomes (19) 
etc. 

p. 280: eq. (30): LHS should read: qt[xo -< x t+ \ -< X\}. The euqation 
numbering in the text should be advanced by 1: i.e equation (15) becomes (16), 
(28) (29) and (29) (30). 

p. 283 Eq. (41) insert on RHS large square bracket before last expression. 
In following para last line becomes ’..as (38)’. 

T 

p. 285, 5 lines before eq. (44) expression should read /^ u ( c t )> 3 lines 

t= o 

before eq(44) opening part should read: Cj + 1% < + ... 

p. 288 all equation numbers in the text should be advanced by 1 . 
p. 289 equation (55) last term should read st.+i -4 

p. 291-2 in line after equation ( 68 ) read In Kq = + O.5.... Advance the 

equation numbers in the text by 1 (also at start of following page 292) 

p.298 in Figure 11.5 the scale should stretch higher so that 1.0 is higher than 
^ . In eq (24) ’subject to’ should read lim Rq T At = 0;... 

p.301 eq (3) should read V(y) = y 0 + hy + 0.5q 2 y 2 

p.302 line 2: delete open bracket after L t . Eq ( 8 ) in numerator of term on 
et it should be a not a 2 . 9 lines below the expression should begin —p 2 (\.... 3 
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lines below denominator of RHS should read 1 — pj3. 

p.309 eq (1) opening term after summation sign is /3*. RHS of eq (2) should 
read /3*tt'(c t ) — HtPt- The second line of eq (3) should read ■< 0 for 7n d = 0. 

In eq (4) the first line should read 0 = = = 0 for m T >- 0 and the 

second line should read -< 0 for = 0. 

p.3Il second line should read ’ ...will run these money...’ In Figure 12.1 
the indifference curve shown for B agents should be broken into two unrelated 
indifference curves. 

p.312 in rubric to Figure 12.2, last line should read ’...with growth, n.’ 
p.316 In eq (18) delete numbering (19) and (20) on next 2 lines. Second line 
of equation should read 

+jr t / l t t+1 ( x t+i) n (xt+i, x t)d x t+i - dt - lPt ^ lSt ~ 1 and 3rd line -r t (x t )s t -i - 
} Renumber next equation (19) 

p.317 Renumber first equation (20) and delete (23) on second line. Renumber 
next equation (21) 

p.318 Renumber the two equations (22) and (23). 

p 325 eq (16) RHS expression multiplying s' t _ x shouldread j Pt r * d± ~ 1 

p.326 eq (26) last expression should readp t d t (l — It)*] eq (27) last expression 
should read p$dt( 1 - I'D* 

p.327 eq (28) RHS second term should read s' t 

p.328-330 advance equation numbers in text by 1, except for first para of 
328 where (23) and chapter 10 eq (9) are correct. 

p.328 eq (34) RHS should read ...(A^-) _p . Eq (42) RHS should read c t [l + 

oP{RXR)~ a P]-^ 

p.329 eq (43) LHS should read ^ ■ Eq (44) LHS 

E t\RXR t+1 ) 


should read 


. 1+. J (iiXB l+2 )-^ 

l + n<r(HXR t + 1 )-rp 


Eti-j 


! U+1 ■ 


p.330 eq. in second line should read 


E t (l+ a ‘ T {RXR t+1 )-‘'>’) 

RXR t+1 ) 


P(^)d t RXR t {l+ 


a a (RXR t )- ap } + ci 

p.374 eq.(129) reads ....if B t < k. In eq. (130) insert large opening square 
bracket before 6. 

p.375 first para, line 1 reads ’ ..i.i.d Bernoulli’ and line 2 ’process (indepen¬ 
dent of «)...’ 

Second para, line 2 read ’...where B t < k. When...’ 

In equations (131) delete all large square brackets. 

In eq (133) insert large opening square bracket after: E t B t+ 1 = q 

Next para line 1 reads ’...when B t < k it grows at’ 

p. 510, reference for Lawrence H. White should be relocated as White, 
Lawrence H. on p. 526. 

p. 517, line after reference for Mueller, the next reference should read, on a 
separate line 
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Mundell, R.A. (1960) etc. 

p. 520 in Romer, D.H. (1996) reference, read New York: McGraw-Hill. 
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